How can I train a LM with a smaller vocab on the pretrained Wiki LM?

martijnd · November 26, 2018, 1:38pm

Is it possible to train a custom Language Model with a smaller vocab (ie 10.000 words) on your pretrained wiki of 60.000 words. When I try to load the weights I get this error.

RuntimeError: Error(s) in loading state_dict for SequentialRNN:
	size mismatch for 0.encoder.weight: copying a param with shape torch.Size([60002, 400]) from checkpoint, the shape in current model is torch.Size([10002, 400]).
	size mismatch for 0.encoder_dp.emb.weight: copying a param with shape torch.Size([60002, 400]) from checkpoint, the shape in current model is torch.Size([10002, 400]).
	size mismatch for 1.decoder.weight: copying a param with shape torch.Size([60002, 400]) from checkpoint, the shape in current model is torch.Size([10002, 400]).
	size mismatch for 1.decoder.bias: copying a param with shape torch.Size([60002]) from checkpoint, the shape in current model is torch.Size([10002]).

seb0 · November 27, 2018, 3:40pm

checkout the line where you formulate a TextClasDataBunch. There you can pass a vocab kwarg and for example assign: data_lm.train_ds.vocab if these are your variables. I hope I understood your problem, happy coding

abyaadrafid · August 6, 2019, 6:39am

Hi, were you able to fix the issue? I’m stuck with the same problem. A lot of people came across the same issue but it seems nobody has a definitive answer for it.