Pretrain & Finetune MLM with fastai 5: Multi-task Learning using fastai

Richard-Wang · June 2, 2020, 2:51pm

I created MultiTaskLearner that

Finetune_GLUE_with_fastai.ipynb: shows hot to use MultiTaskLearner
mutlit_task.py: define MultiTaskLearner and other classes

Notes for Mechanism:

cycle
If you don’t just switch output layer of model for your multi-task learning
Just create your own nn.module class MultiTaskModel and define switch and __len__ methods, so that after model.switch(i), model(x) will use a internal model that can solve task i. It’s kind of like a bunch of models, implementation should be super easy.

I know I should post a result with multi-task training all GLUE tasks, but Colab told me out of memory and using vscode that connects to our lab server can’t print pretty training result sheet. Please tell me how to solve this dilema.
Actually Electra didn’t multi-task train GLUE tasks, which I learned after I created MultiTaskLearner.
It is worse than single task learning, it may due to need to adjust task weitghts / GLUE is not suited for multi-task learning.

Previous posts.

Also follow my twitter Richard Wang for updates of this series.

(I am going to reproduce the ELECTRA-small results in Table 8 of the paper, from scratch !!)