A quick and easy way (and the right approach) is described here: Combining Tabular + Images in fastai2 (and should work with almost any other type)
Basically you make two DataLoaders
, one for the text and one for the image then combine them via my MixedDL
. It’s by far the easiest approach IMO rather than fighting the API