Why is lr_find loss vs learning look so smooth

kechan · December 20, 2018, 4:29am

Got a Q about lr_find() as discussed in DL1. The plot of loss vs. the learning rate look very smooth compared to what I have. Is this due to batch_size? I used 32. Anyone know what they used in DL1?

eraldoluis · November 26, 2019, 7:12pm

I found this great article by Sylvain Gugger that explain this and more in details.