Lesson 3 In-Class Discussion

jeremy · November 14, 2017, 7:12am

It’s close - but it’s just for batchnorm layers; it disables their moving average updates.

jeremy · November 14, 2017, 7:15am

Yes exactly.

jeremy · November 14, 2017, 7:17am

Yes there is, although it requires writing a custom loss function. We won’t cover that in this part of the course. Funnily enough, it turned out in this competition it didn’t help, since sometimes images were incorrectly labeled with two weather labels!

Nice job to come up with this idea BTW - very deep insight.

jeremy · November 14, 2017, 7:20am

Generally you want as big as your GPU can handle, but of course no bigger than the original input size.

jeremy · November 14, 2017, 7:21am

The other main choice is to have none at all (i.e. a linear output).

jeremy · November 14, 2017, 7:23am

learn.model contains the pytorch model

jeremy · November 14, 2017, 7:24am

All the models we’ve used work on any sized input. We’ll learn the details of how later.

jeremy · November 14, 2017, 7:25am

No, not much thought at all, I’m afraid… Seems to work ok though

jeremy · November 14, 2017, 7:26am

In fact, you’ll see in f2 in our planet.py that it searches for a good threshold!

jeremy · November 14, 2017, 2:01pm

I’ve added the lesson video to the top post of this thread. (It’s currently uploading - will be available in under an hour from now).

jeremy · November 14, 2017, 3:15pm

They are added to each lesson’s wiki thread (including this one!) the day after the lesson.

divyansh · November 14, 2017, 4:59pm

I have 48 X 48 single channeled images of human facial expressions downloaded from here. It’s an expression classification problem. Should I upsample to use pre-trained models or should I train a convNet from scratch?

datafool · November 14, 2017, 6:14pm

Yes, filters are arrived via SGD.

datafool · November 14, 2017, 6:20pm

Yes, it will be 3*3*3 for 3 channel input and for 1 filter. In the excel sheet Jeremy uses 2 filters one for vertical edge and another for horizontal edge. In this case size of filter Tensor would be 2*3*3*3 i.e. if K-filters are used and image is 3 channel then size of filter Tensor would be k*3*3*3

wgpubs · November 14, 2017, 7:44pm

But as the torchvision resnet model does not have the same final two layers that the fastai framework appends (based on your # of classes), won’t the call to model.load_state_dict(my_model_state) throw an exception?

jeremy · November 14, 2017, 11:26pm

FYI your * characters are being interpreted as italics formatting in your post. Wrap like this to avoid the problem:

`3*3*3`

jeremy · November 14, 2017, 11:27pm

Right that approach won’t work.

stathis · November 15, 2017, 12:39am

What does the precompute flag do?

learn = ConvLearner.pretrained(arch, data, precompute=True)

anandsaha · November 15, 2017, 12:40am

Right, that was a generic example. The OP wanted an example without fastai.

We can’t load Torch’s resnet34 model into fastai’s resnet34 model for the reason you stated.

jeremy · November 15, 2017, 1:20am

Search the forum - we’ve discussed that one quite a bit.