Thanks Rachel. I went through the code and ran notebook 1 yesterday.
Even though I have GTX 1080 in my home system, I had issues running the notebook at that batch size of 224. I played around by increasing size from 32,64,128,160,192. I could never get the same accuracy numbers, they were always less than the original notebook. I tried playing around with learning rate by decreasing it and increasing the numbers, but the result was still the same.
I assume, the code is using batch SGD. Do you have any reading material on how does the size of batch impacts the optimization?
I think, I will play around by changing the optimization algorithms next. But as far as I read, ‘Adam’ and new algorithms maybe faster but SGD is still more accurate.
Finally, thanks to you and Jeremy for the effort you guys put in this course.