What difference during Language model training between frozen and unfrozen parts. Which RNN layers are not updated during frozen NN training?
What difference during Language model training between frozen and unfrozen parts. Which RNN layers are not updated during frozen NN training?