Lesson 7 - Official topic

jcatanza · May 1, 2020, 9:31pm

Yup. All installed.

jcatanza · May 1, 2020, 9:33pm

Yes, with
from fastcore.utils import *

muellerzr · May 1, 2020, 9:41pm

It’s in the utils from fastbook.utils, not fastcore

mgloria · May 1, 2020, 9:51pm

How to save a model for further training later on?

I am halfway with Lesson 7 but I could not found yet an example of how to save a model that I am halfway training. I would like to be able to then load it to continue the process.

This is what I tried:

I am however not sure of how the filename should look like when saving or what parameters is load_model expecting (i.e. if in a new session I am loading the model I do no longer have the learner or the optimizer… )

Could somebody help me out with an example? Thanks a lot

jcatanza · May 1, 2020, 10:20pm

Thanks, Zachary. So I did this:

# install the utils.py from fastbook
%cd '/content/drive/My Drive/fastbook/'
pip install utils
%cd ..

But I still get the NameErrors for those two lines

muellerzr · May 1, 2020, 10:44pm

You don’t install, simply import (as it’s just a .py file you already have in the system!)

SMEissa · May 1, 2020, 11:28pm

Does anyone got this result when using the weight decay or is it only my luck? it got worst, can I change the wd to lower than 0.1? or would this would cause bias in the data?

jcatanza · May 1, 2020, 11:59pm

Thanks @muellerzr Zachary,
so this time:

# install the utils.py from fastbook
%cd '/content/drive/My Drive/fastbook/'
import utils
%cd ..

But no cigar – same errors.

muellerzr · May 2, 2020, 12:04am

from utils import *

jcatanza · May 2, 2020, 12:32am

Third time’s the charm! Thanks for your patience, Zachary!

muellerzr · May 2, 2020, 12:41am

My pleasure glad we got it working!

jcatanza · May 2, 2020, 2:17am

Hi @mgloria – which notebook are you in?

mgloria · May 2, 2020, 8:06am

This is my own code. I am trying to train a model that takes a while (>15 min/epoch) and I would like to know who to save it for later further training. I could not find an example in the notebooks for save_model and load_model.

Do you @muellerzr maybe know?

mgloria · May 2, 2020, 8:10am

Hi @SMEissa! If we focus on the second case (wd=0.1), by epoch 5 both your training and validation losses are still getting better… so try training a bit longer until your training loss keeps getting better but your validation loss starts getting worst. This is the time to stop.

Weight decay has a regularization effect that prevents from ovefitting (which is a good thing), but it means also that it can take longer for your model to learn. That’s why in your second case more than 5 epochs may be required.

jcatanza · May 2, 2020, 4:45pm

I think you have it right. You can choose any file name you want for the model. But then when you use load_model you have to pass it the filename. So in your example, you can retrieve the saved model with
my_model_objects = load_model('my_model.pth')
then you can check if you got everything you saved with
dir(my_model_objects)
You should see the model and the optimizer that you saved.

muellerzr · May 2, 2020, 5:09pm

You could also utilize the SaveModelCallback, which has a parameter for a filename that it will save it to (I believe you can also have it simply save every iteration). Then do a learn.load (or load_model) to bring it back in

SMEissa · May 2, 2020, 6:47pm

Thanks a lot for the clarification!

utkb · May 2, 2020, 10:14pm

In the video, and also in the relevant fastbook notebook here, for weight decay it says that:

loss_with_wd = loss + wd * (parameters**2).sum()

which, in derivative, is equivalent to (note: ‘parameters’ above has been swapped for ‘weight’ below):

weight.grad += wd * 2 * weight

Shouldn’t it be loss.grad instead? i.e. (I’ll use the original naming of ‘parameters’ here):

loss.grad += wd * 2 * parameters

Or have I misunderstood something…?

Thanks.

Yijin

jcatanza · May 3, 2020, 1:24pm

Great question, Yijin! In principle you are correct. But PyTorch uses a slick notation trick:

weight.grad implictly calculates the derivative of the loss function with respect to weight.

utkb · May 3, 2020, 1:33pm

Ah right. Thanks for your clarification : )

Yijin