Transfer learning for tabular data - how to replace the head?

sambit · September 10, 2020, 10:15am

I’m trying to build a multi-label classifier with TabularModel and TabularLearner. There are 206 targets I need to predict. However, 402 additional (auxiliary) targets are also provided (which I don’t need to predict).

The idea:

Train a model for the 402 auxiliary targets first.
Use the above as a pre-trained model & apply transfer learning to then train the final model (206 targets).

The input features for both models are exactly the same.

Strategy:

Create new architecture with same body & new head.
Load the ‘body weights’ of pre-trained model

How do I do this? In particular, how do I load just the body weights?

vferrer · September 18, 2020, 8:24am

Hi @sambit.
I would use the same strategy as cnn_learner:

Load the full pretrained model
Remove the head. In your case, you may want to keep embedding layers only.
Initialize the new layers
Freeze the model (you’ll need to pass a custom cut function)
Train the head
Train all the model

muellerzr · September 18, 2020, 10:47am

Searching the forums would have resulted in this post, with all the answers:

TL:DR, transferring would really only be valuable at the embedding level, considering it the equivalent of our pretrained ResNets