Inference Learner for collaborative filtering

Andreas_Daiminger · December 14, 2018, 1:38pm

What’s the best way to try out my trained collab_learner on a test set?

Is this the way to do it?.

preds= learn.get_preds(ds_type=DatasetType.Test)
preds = preds[0].numpy()

I want my test preds as a numpy array.
The test set was specified when creating the CollabDataBunch instance.

data = CollabDataBunch.from_df(data_train, seed=42, path=path, test=data_test)

sgugger · December 14, 2018, 2:24pm

That seems about right.

gevezex · March 12, 2019, 2:03pm

How should the test dataset be provided?
For example if the train set has 1000 users and 1000 movies. And the test set has only 1 user with 999 movie reviews. Should that one movie what is not rated presented with a NaN value?

sgugger · March 12, 2019, 4:36pm

I think it’s the other way round, but any value in your test set that isn’t in your training set should be replaced by NaN.