Pablo
May 24, 2021, 12:36pm
1371
For the moment I’ve found this:
I know … the NLP hypetrain has moved on to HuggingFace and transformers - but I still love ULMFiT.
fastai and ULMFiT are still great - easy to use, fast to train and imho still SOTA for classification. But what was missing (esp. for fastai2 with SentencePiece) were easy to use pretraining scripts and pretrained models. Thats what I am trying to fix with this repo .
About 4 months ago I published a repository to pretrain a German ULMFiT with SentencePiece from scratch - based …
I’m still reading through the details, but at least it contains relevant stuff like SentencePiece tokenization, and the reported metrics are really good, so I’ll definitely give this a try.
Thanks for sharing that, @florianl !
1 Like