Issues with Fine tuning a languege model

I am trying to train a languege model on a big dataset, and then fine tune it on a smaller one. (As was propposed in lesson 4) When i try to load the trained model (the one trained on the larger data set) the embedding sizes don’t match (larger curpos vocab is bigger) how should i solve this issue?