Lesson 4 In-Class Discussion ✅

KarlH · November 14, 2018, 3:47am

The goal is for the model to use a user ID and a movie ID to predict a users rating for that movie.

tukun · November 14, 2018, 3:47am

@rachel, 6 likes here.

rohitr · November 14, 2018, 3:47am

Were you training the language model from scratch or fine tuning like in the class example? Is it possible to add words when fine tuning? (in general, not fastai specifically)

sgugger · November 14, 2018, 3:48am

It’s going to be as skewed after normalizing so that doesn’t solve that problem.

nbharatula · November 14, 2018, 3:48am

A given user for a future movie or any user for any movie?

fredguth · November 14, 2018, 3:49am

That works if you know how to translate jajaja to label. That is ok. But if you want to use a token for names, you need to recognize that as a name.

joshfp · November 14, 2018, 3:49am

I think you can transfer the categorical embedding from model to model. Let’s say that in your company you previously trained a model with a feature ‘store’, then if you are training a totally different model that also makes use of the feature ‘store’, you can grab the embedding from the previous model and use it to initialize the store embedding of the new model. Jeremy mentioned a similar example in a previous version of the course.

rachel · November 14, 2018, 3:49am

You are creating a model where you give it a movie id and user id and get a rating as output.

In practice, it is typically used to predict ratings for a movie someone hasn’t seen yet (to predict what they might like so you can make recommendations).

KevinB · November 14, 2018, 3:49am

Is there a source to learn more about the cold start problem?

ladydata · November 14, 2018, 3:50am

maybe within the realm for things like chatbots or word spelling where you need to predict the word?

fredguth · November 14, 2018, 3:51am

I am confused on how this collab filter is different from tabular. Is this just a special case?

iyersathya · November 14, 2018, 3:51am

Convert emoji’s to words may be using some mappings, before feeding it into NN.

lesscomfortable · November 14, 2018, 3:51am

Yes, you need to recognize the tokens somehow to identify them.

julclu · November 14, 2018, 3:53am

Do you ever have to take into consideration that you have multiple samples/observations per subject with deep learning? e.g. when you have multiple movie reviews from the same person, or when you have multiple images from the same brain or slices from the same MRI for classification, or do neural nets not care?

lesscomfortable · November 14, 2018, 3:53am

Finetuning. Yes, that is possible and easier in fastai since weights are matched internally. For more info see load_pretrained.

gpakosz · November 14, 2018, 3:53am

Jeremy just mentioned there are different LMs in the Zoo for different languages. Do you have something “meta” like a LM to do language detection first?

PegasusWithoutWinds · November 14, 2018, 3:54am

emoji is usually encoded with the word that describes the expression. For example, this is :joy:. So yes, there is an easy mapping here. Just remove the ::.

sethkunal · November 14, 2018, 3:54am

What’s the role of timestamp in collaborate filtering? Does it need to know about movie genres or other meta data about the product ?Should we consider browsing pattern on collaborative filtering?

PegasusWithoutWinds · November 14, 2018, 3:55am

Happy birthday, Jeremy!
生日快乐🎂

vernboy · November 14, 2018, 3:55am

Happy birthday!