I’m working on the Kaggle TMDB challenge. I created a databunch like so
all_data = (TabularList
x, cat_names=cat_names, cont_names=cont_names, procs=procs, )
.label_from_df(cols=dep_var, label_cls=FloatList, log=True)
I was trying to train on all data so I used split_none(). The train dataset in my databunch has 3000 items, which is as expected. I then created and trained a model, which seemed to be working as expected. But when I try to get the predictions…
train_pred, _ = learn.get_preds(DatasetType.Train)
I get back something with length 2944. What could be happening to the other 66 items?