You should check the code in the new notebook lesson5-rossmann. It’s not ready for class yet, but you have a few elements, mainly:
data = (TabularList.from_df(df, path=path, cat_names=cat_names, cont_names=cont_names, procs=procs)
.split_by_idx(valid_idx)
.label_from_df(cols=dep_var, label_cls=FloatList, log=True)
.databunch())
for doing your data. The label_cls=FloatList
is the thing that will force regression, you can remove log=True
in your case since you don’t want the log of the targets I believe.