Optimizing text classification for small datasets

Hi.

I’m trying to optimize a text classifier for small subsets of the IMDB (something like the IMDB sample or even smaller) based on this starting point:

Has anybody tried something like that and can share recommended hyperparameters?

Thanks!

1 Like