I made one small change to Jeremy’s Kaggle NLP notebook from this:
model_nm = 'microsoft/deberta-v3-small'
To this:
model_nm = 'distilroberta-base'
The tokenizer with this model is different but was still working.
But then I get the following error during the first epoch after running >> trainer.train();
My kaggle notebook with the complete saved error message is here: https://www.kaggle.com/code/mattrosinski/getting-started-with-nlp-for-absolute-beginners
Reverting to 'microsoft/deberta-v3-small'
fixes the problem but why does the trainer fail when using 'distilroberta-base'
?