Lesson 11 discussion and wiki

sgugger · April 11, 2019, 2:19am

In notebook 6. Little trick: typing a class or function name in a cell gives you where it comes from.

jcatanza · April 11, 2019, 2:20am

Regarding processors that map categories to numerical labels: How do you handle the case of online streaming data, which may contain new categories not seen yet in previous data used for training?

mediocrates · April 11, 2019, 2:20am

FWIW, the latest 08.ipynb says “We use the ListContainer class from 08…”

nswitanek · April 11, 2019, 2:22am

Sorry if I missed this, but does the split function check to make sure every label has corresponding images in both training and validation sets?

yonatan365 · April 11, 2019, 2:22am

in LabeledData class, what is the @classmethod decorator doing?

champs.jaideep · April 11, 2019, 2:23am

has pytorch got any such inbuilt split functionality by ids/funcs ?

Interogativ · April 11, 2019, 2:23am

static method

piotr.czapla · April 11, 2019, 2:24am

Anyone is a bit worried that the creation of the vocab is implicit? If you reorder labeling lines (training with valid) you get different label values.
This isn’t a problem if you train but if you think about inference you might just get that wrong easily.

KevinB · April 11, 2019, 2:24am

Creating an “Other” class kind of sounds like the “None” that we talked about last week that didn’t work.

marii · April 11, 2019, 2:24am

Another way to not do much better than random is normalizing the validation set by its own standard deviation and mean.

harikrishnanrajeev · April 11, 2019, 2:25am

do you look for distribution of classes and see whether its balanced ?

sgugger · April 11, 2019, 2:25am

A PR to fix that typo would be welcome

sgugger · April 11, 2019, 2:25am

Not if you don’t make it do so.

sgugger · April 11, 2019, 2:26am

That’s why the vocab is sorted by alphabetical order.

ThomM · April 11, 2019, 2:27am

I thought we were talking last week about how “other” categories were tricky because we’re effectively asking the classifier to detect things that are positively aspects of negatively being a thing… I haven’t reviewed so I might be misremembering, though. I wonder when it is & isn’t a good idea to have an “other” category vs. eg. a loss function which would give low weights to low confidence predictions + confidence cutoff when displaying a prediction output. (This is pretty off topic)

jcatanza · April 11, 2019, 2:27am

That is data leakage, also called “data snooping”. It can lead you to overestimate the generalizability of your model.

sgugger · April 11, 2019, 2:27am

At the ned of the day, it’s a design decision for your model. You can decide that the target for other is everything at 0.

harikrishnanrajeev · April 11, 2019, 2:27am

do we always need to convert them to same size ? is that a requirement ?

sgugger · April 11, 2019, 2:28am

You can’t batch them if they aren’t all of the same size.

nswitanek · April 11, 2019, 2:29am

Given a distribution of image sizes, how do you choose the dimensions to resize all images to?