Lesson 2 In-Class Discussion ✅

AlexisGallagher · October 31, 2018, 2:09am

For me, it seems to require CMD-OPTION-C on macOS (Safari 12.0) not CMD-OPTION-J to pull up the JavaScript Console.

saadorj · October 31, 2018, 2:10am

How to address issues with imbalanced data? aka some classes have very few photos compared to others?
Aka data augmentation that is weighted by ratios of class imbalances or something like that?
thanks

sandmann · October 31, 2018, 2:10am

When there are unsuitable images (e.g. drawings instead of photos) in the training dataset. How can I best remove them? Should I?

nbharatula · October 31, 2018, 2:10am

What’s the metrics=error_rate line for?

ladydata · October 31, 2018, 2:10am

Yes, I’d like to know how to handle images that are inherently not squared, say all of them will be very rectangular

gamino · October 31, 2018, 2:11am

When doing a lr_find(), is it actually training the model?

avinashj · October 31, 2018, 2:11am

Is the size of validation-set always 20% or does it depend upon your data size ?

lesscomfortable · October 31, 2018, 2:11am

This is going to be explained in a few minutes

ram_cse · October 31, 2018, 2:11am

It should be removed manually.

Jess · October 31, 2018, 2:11am

No, it’s trying out different LRs to help find the best via visualization.

sgugger · October 31, 2018, 2:11am

It’s a mock training with a various range of learning rates. But the original model is loaded after, so it doesn’t change the weights.

Mauro · October 31, 2018, 2:11am

looks like 3e-3 would have been better

champs.jaideep · October 31, 2018, 2:12am

what if curve is seen flat for many iterations unlike this one where it goes high in just few iterations

voliv · October 31, 2018, 2:12am

what is the y axis in the lr_find graph?

raghavanm · October 31, 2018, 2:12am

Question : Is training loss and error rate same thing computed on training data and test data ?

Mauro · October 31, 2018, 2:12am

Error rate

atchuth · October 31, 2018, 2:13am

Karpathy said validation sets should be made carefully, Rachel also has an article about it. when Is it ok to randomly split data ?

sgugger · October 31, 2018, 2:13am

Just on the training set.

marcmuc · October 31, 2018, 2:13am

Not sure, but @william has an intersting approach to curating scraped datasets. Have a look here:

lesscomfortable · October 31, 2018, 2:13am

You can send a ‘size’ parameter to ImageDataBunch that will crop and pad your images to get them to be of your desired size.