ULMFiT: unequal inference time

dnzzl · February 8, 2019, 2:52pm

Hello everyone,

I’m trying to deploy ULMFiT models for several languages (English, Spanish and French).

I have exactly used the same workflow for all languages, based on sgugger DeepFrench notebook, using related pre-trained weights.

English model is 138 MB, Spanish 170 MB, and French 93 MB.

However, when it comes to inference time, I have some big difference on the same machine in the same conditions:

What can explain this difference? How can I improve this?
Bonus question: Was someone able to export the model with the ONNX format ?

Thank you for your help

sgugger · February 8, 2019, 3:46pm

Is this all with the same exact version of fastai? I pushed things to make the predict method for language models faster recently.

dnzzl · February 8, 2019, 3:48pm

It is indeed, trained with 1.0.43.dev0.

Maybe I’ll wait for the next release to sort out CPU weights and new learner args.

dnzzl · February 13, 2019, 8:48am

Is it possible the issue is not directly related to Fastai but rather with Spacy (#3242) ?

sgugger · February 13, 2019, 2:24pm

It’ is likely since it’s the only differences I can see between languages.