Lesson 9 Discussion & Wiki (2019)

KevinB · March 26, 2019, 2:44am

Here is what you were talking about. Thanks Sylvain.

sgugger · March 26, 2019, 2:44am

yield keeps the state of the for loop until it’s finished.

jcatanza · March 26, 2019, 2:44am

sometimes we don’t want to reset gradients to zero, for example in a warm-start.

champs.jaideep · March 26, 2019, 2:44am

where is iter of Dataloader invoked ?

gmohandass · March 26, 2019, 2:44am

Doesn’t the provided collate function void the benefits of iterators?

sgugger · March 26, 2019, 2:45am

When you call for bla in dl

sgugger · March 26, 2019, 2:45am

In what sense? You need a way to collate your samples together in a batch.

iyersathya · March 26, 2019, 2:46am

Right, my point was to always clear it by default, but if don’t want to clear it specify it explicitly, func like keep_grad

Dee · March 26, 2019, 2:46am

we must be sampling without replacement

gmohandass · March 26, 2019, 2:46am

Well, doesn’t it get the whole batch into memory? What if more than one row of a batch can’t fit in memory?

sgugger · March 26, 2019, 2:47am

A link has been posted twice to answer that question. Please click on it before asking it again

nok · March 26, 2019, 2:47am

I have put it up in the wiki.

sgugger · March 26, 2019, 2:47am

Now, you only collate the samples you want to yield right now. If your batch size is 64, just those 64 samples.

devforfu · March 26, 2019, 2:47am

Note that num_workers > 1 could bring you some problems with memory leakage. Not sure if that was somehow fixed in the most recent PyTorch or if it’s an essential Python’s issue.

SHAR1 · March 26, 2019, 2:51am

A lot times in kaggle and colab kernels. (a very big headache)

karthik.subraveti · March 26, 2019, 2:51am

How do we incrementally train the model? Do we use the parameters of the original training data i.e. mean and std dev and keep training it?

champs.jaideep · March 26, 2019, 2:53am

thanks…
What does
With torch.no_Grad ensures ?

sparalic · March 26, 2019, 2:54am

@champs.jaideep It ensures that you do not perform backprop on the validation set in this case…

Gabriel_Syme · March 26, 2019, 2:55am

If I can ask a bit of an off topic question. Do we anticipate more integration of generative models in fast.ai soon? Or is it that the main components are given and it’s up to us to construct the more complex models?

sgugger · March 26, 2019, 2:55am

When you do the forward pass, you need to store intermediary results for the backward pass (as we have shown in notebook 02). Using with torch.no_grad removes that default behavior to let you save GPU memory.