Lesson 2 In-Class Discussion

srmsoumya · November 7, 2017, 2:38am

Thanks!

stathis · November 7, 2017, 2:40am

How come the validation loss is smaller than the training loss?

iandanforth · November 7, 2017, 2:40am

Is lesson1 using the test data at all? I don’t see a cats/dogs split in the test data dir.

ar_ai · November 7, 2017, 2:40am

Why aren’t we looking at validation accuracy? I thought validation accuracy and validation loss don’t correspond linearly. Or was it validation error?

johnnyv · November 7, 2017, 2:41am

In Kaggle competitions you don’t get the labels for the test dataset. You can only predict over the test dataset.

zaoyang · November 7, 2017, 2:42am

There’s a recent paper called “Don’t Decay the Learning Rate, Increase the Batch Size” Is adjusting the learning rate the most effective for converging or is adjusting the batch size effective as well?

surmenok · November 7, 2017, 2:43am

Test data doesn’t have labels. These are unlabeled images from the Kaggle competition. If you want to make a Kaggle submission, you should get these images, predict a class using your model and upload predictions to Kaggle.

iandanforth · November 7, 2017, 2:43am

So the labels weren’t released after the fact?

surmenok · November 7, 2017, 2:44am

As far as I know, correct labels for the test set are not available at kaggle.com.

garyzhalo · November 7, 2017, 2:45am

You have to have super GPUs to increase enough batch size

zaoyang · November 7, 2017, 2:48am

what’s the atom optimizer?

yinterian · November 7, 2017, 2:49am

https://arxiv.org/abs/1506.01186

init_27 · November 7, 2017, 2:49am

Cyclical Learning Rate

anandsaha · November 7, 2017, 2:49am

–

yinterian · November 7, 2017, 2:49am

Adam, he will explain it later.

garyzhalo · November 7, 2017, 2:49am

Actually, it is Adam. Another optimizing method

gaylejj · November 7, 2017, 2:50am

Does this Cyclical Learning rate method work for all ML architectures, or is it meant for this sort of classification problem?

jkru · November 7, 2017, 2:50am

Does the learning rate finder work with all types of NNs or just CNNs?

yinterian · November 7, 2017, 2:50am

For all.

shubham24 · November 7, 2017, 2:51am

I didn’t get the reason for not picking the minima of the learning rate vs loss curve, but a rate slightly higher.