ULMFiT - French

(Thomas Chambon) #22

I have pushed on github the movie reviews classifier notebook as well as the weights/vocab of the french LM.
You can download it from here: https://github.com/tchambon/deepfrench

Still waiting for an answer of DEFT competition people to confirm the very good results (new SOTA) on the 4 labels tweets classification.

3 Likes

(Vintila Claudiu) #23

Thank you for sharing! :blush:

0 Likes

(Waleed) #24

Good Work!

Could you give us more information about the imdb-like french movie review dataset?

0 Likes

(Thomas Chambon) #25

The website is named Allocine, I took the data using web scraping.

2 Likes

(Bruno Seznec) #26

Hello

Trying to adapt imdb notebook , (fastai v0.7) to use pretrained
but I get this error

KeyError: ‘unexpected key “0.encoder_dp.emb.weight” in state_dict’
After
learner.model.load_state_dict(wgts)

on the other hand fastai v1 , I get also an error

Thanks for any hint

Rgds
Bruno

0 Likes

(Battle500) #27

Hi man,
I gave a look at your github and congratulations for a really great job.
Just a question, I am not sure what does the itosref30.pkl file contain? It should have only the data_lm.vocab.itos, where data_lm is the ‘general’ french language model, right?

0 Likes