Lesson 2 In-Class Discussion

neovaldivia · November 7, 2017, 4:38am

When you say start with smaller images, you mean cropped versions of the original ones?
then switch to normal ones.?

yinterian · November 7, 2017, 4:38am

These models don’t have an “input size”, any size works.

zaoyang · November 7, 2017, 4:39am

what is cycle_mult forgot what it does exactly.

nafizh · November 7, 2017, 4:40am

When you are using the whole dataset for training at the end, how do you know you are not overfitting?

pete.condon · November 7, 2017, 4:40am

The multiplication of the number of epochs per cycle.

KevinB · November 7, 2017, 4:40am

Does making copies of the rare cases give the neural network a false sense of the chances of that event happening or overfit that specific result?

e.g. you have 1 pink unicorn and say you have 100 pink unicorns completely dissing the purple unicorn community.

surmenok · November 7, 2017, 4:40am

I don’t know specifics of the architecture we use here, but, generally, you can deal with different dimensions of CNN inputs by adjusting size of a pooling layer.

yinterian · November 7, 2017, 4:42am

You usually don’t use your validation for training.

unrahul · November 7, 2017, 4:44am

Is the dog breed notebook available anywhere ?

zpnc · November 7, 2017, 4:44am

What strategy do you recommend if the input image size is very small (i.e. 10x10 or similar) or it has more channels (RGB + additional)? To my understanding for these two cases the pre-trained models (i.e. resnet34) can’t be used, right? Is building from scratch the only option?

nafizh · November 7, 2017, 4:44am

Oh, okk, thanks. I thought Jeremy mentioned about training with the whole data at the end for the dog breeding competition.

varmonas · November 7, 2017, 4:46am

IMO: What matters with setting dimensions is the final FC layers - CNN layers can adapt to to different sizes. However when you connect to FC layer - it will cause error. However, if you use Fully convolutional network (no Dense / fully connected layers) then it should work.

jeancarlos · November 7, 2017, 4:46am

Are you going to be sharing the Dogs notebook from the lecture as well?

aloisius · November 7, 2017, 4:47am

Yeah but doesn’t the pooling layer basically downsample to match the pretrained model’s input layer?

If I’m understanding what he did, it was to crop a larger image to 299x299 instead of 224x224. The pooling layer then effectively downsamples it back down to 224x224.

hamelsmu · November 7, 2017, 4:47am

Whats the difference between precompute=True and freezing layers?

zpnc · November 7, 2017, 4:47am

@nafizh He did. He recommends to use the whole training set at the end. But this is when you’ve already confirmed that your training strategy (lr rate, schedule, …) works and you’re not severely overfitting with validation set.

css · November 7, 2017, 4:48am

@yinterian @jeremy Could you please cover how to save interim weights, lets say if I wanted to average the params at specific points based on some conditions?

yinterian · November 7, 2017, 4:49am

learn.save

nafizh · November 7, 2017, 4:49am

Thanks @zpnc . That makes things clear for me.

css · November 7, 2017, 4:51am

@yinterian Can that be done while the learning is running? Does the optimization not need to be paused before the params are saved?