So at the end of the pascal notebook we have this:
learn.fit(lr, 1, cycle_len=3, use_clr=(32,5))
Looking at the code for
learner.py I found that when specifying use_clr we end up using a CircularLR schedule instead of a CosAnneal. But even after looking at the CircularLR code I’m having trouble identifying what the parameters
clr_div, cut_div means, and why is better to use a Circular anneal instead of a Cosine in this case?