Lesson 1 part 1 v2 custom images

priyal · February 10, 2018, 4:51pm

So I tried classifying beaches and mountains with a very small dataset (on the lines of what Nikhil B did here Cricket or baseball? Lesson 1 with small datasets)

I used https://github.com/hardikvasa/google-images-download to download 40 images each of Beaches and Mountains. Used about 30% images of these for the validation set.
Tried using LR finder with a reduced batch size (of 2) to find optimal learning rate, but since the number of images is very less, didn’t get decent plot. So with hit and trial, settled on 0.01 as the learning rate.

Screenshot from 2018-02-10 21-58-27.png1392×751 50.9 KB
Screenshot from 2018-02-10 21-58-39.png978×458 29.4 KB
I got 100% accuracy pretty quick. Probably because the first 40 images of either category downloaded from Google Search are pretty recognizable, and binary classification in such a scenario would not be too difficult.

Screenshot from 2018-02-10 21-57-32.jpg1458×719 322 KB
One thing I could not understand was that the training loss was higher than the validation loss. However both of them kept decreasing even after a number of epochs (after 100% accuracy), indicating that the model was becoming surer of its predictions.

Screenshot from 2018-02-10 21-56-05.png1477×527 39.9 KB
Most uncertain predictions too are separated enough.

Screenshot from 2018-02-10 21-57-40.png1423×404 333 KB
Data Augmentation and Differential Learning Rates after unfreezing the model did not have significant impact here.