How to best integrate both image and tabular data in a single model using fastai?

jwuphysics · April 8, 2019, 8:33pm

Hi @ytian, hope you’ve been able to make some headway here! I’ve also worked on a similar problem and found some other resources that other forums members have shared. In particular, the guide mentioned in this post was extremely helpful.

I’ve created a Github gist that lays out my own steps (although with specifics redacted), and the link is at the bottom of this post. Hopefully this is useful to you!

One of the best things about the Fastai library is that the layered learning rates are absolutely crucial here. For example, you might create a model that has layer groups like:

     ┌───>    conv1 ───> conv2  ───┐
     │                             │
data ┤                             ├───> final ───> output
     │                             │ 
     └─────────────────> tab1 ─────┘

as is the case in my example. The final + tab1 linear layers are grouped together, and so appropriately can be trained using the same learning rate. For the pretrained conv1 and conv2 layer groups, we may want to use a substantially lower learning rate. Anyway, just my two cents.

gist.github.com

https://gist.github.com/jwuphysics/f19162d38e1b4fed0030b96411186c3a

TabConvLearner.py

from fastai import *
from fastai.tabular import *
from fastai.vision import *

PATH = os.path.abspath('..')

# distinguish categorical and continuous variables, and dependent variable
cat_names = ['cat1', 'cat2', 'cat3']
cont_names =['cont1', 'cont2']
dep_var = 'target'

This file has been truncated. show original