Chapter 10 Learning Finder for Text Applications

gviviano · October 8, 2021, 6:06pm

I have a number of questions:

Why can’t you rerun learning rate finder between each unfreeze?
Where do these values for LR come from in 10_nlp.ipynb and how were they picked?
learn.fit_one_cycle(1, 2e-2)

learn.freeze_to(-2)
learn.fit_one_cycle(1, slice(1e-2/(2.6**4),1e-2))

learn.unfreeze()
learn.fit_one_cycle(2, slice(1e-3/(2.6**4),1e-3))
How do we decide on the “(2.6**4)” in the slice???

Conwyn · October 8, 2021, 6:38pm

|Hi Gregory

Not a really helpful example but Jeremy said on one of the videos (I can not remember the course year) but he found 2.6 works well.

Regards Conwyn

muellerzr · October 8, 2021, 6:42pm

Correct.

Generally we do not find a learning rate in between each fit, you stick with the first finding and then adjust it by this 2.6 ** 4

You can now, and with the new lr suggesters it’s more reliable to do so, just it takes a bit longer. The 2e-2, 1e-2, 1e-3, etc is just a rule of thumb he used that works

gviviano · October 8, 2021, 8:09pm

Thanks for the quick response. I was wondering if it as rule of thumb or actually calculated.

I am running v2 and I have am having trouble running it after then first fit. Are there examples anywhere?

gviviano · October 8, 2021, 8:12pm

When I used it on my example I fit a LR and then used it all the way through. so I think I used

learn.fit_one_cycle(1, 5e-3)
learn.freeze_to(-2)
learn.fit_one_cycle(1, slice(5e-3/(2.6 ** 4),5e-3))
learn.unfreeze()
learn.fit_one_cycle(2, slice(5e-3/(2.6 ** 4),5e-3))