Was this an issue in fastai v1?

ilovescience · October 23, 2019, 1:09am

Is this an issue that would have affect fastai v1 in the past?

sgugger · October 23, 2019, 1:43pm

v1 was using the Adam optimizer and doing decoupled weight decay on its own for AdamW, so it would the issue would be there.
I wouldn’t go as far as saying you need to retrain all the models though.

ilovescience · October 23, 2019, 6:03pm

Ok sounds good.

Yes retraining all our models is probably an overkill, but this may be more important when trying to replicate a paper for example.