Resume training with fit_one_cycle

Alexandre_DIEUL · January 31, 2020, 7:13am

Hi,

Sorry for the noob question but, in my case, a preemption of my GCP VM interrupted my training. The vm was simply shut down.
If I resume training with fit_one_cycle, passing along start_epoch with the right epoch number, I don’t think it would work as the model I would pass would have been saved before launching fit_one_cycle, or am I missing something ?

If I’m not mistaken, How can I save the model automatically at each end of epoch so I can resume training in case of a shutdown/preemption ?

Thanks a lot for any answer on that matter and thanks for the incredible work you folks are doing here.

Regards,
Alexandre.