Lesson 9 Discussion & Wiki (2019)

ThomM · March 26, 2019, 2:38am

Would this method be improved by shuffling the training or validation sets when creating the minibatches? (Or maybe using the previously demonstrated integer indexing to index random rows)? Or is it on the practitioner to decide whether that’s appropriate & do it as part of preprocessing?

sgugger · March 26, 2019, 2:38am

That’s the next steps

mindtrinket · March 26, 2019, 2:39am

Yes, sorry, I think found it https://docs.fast.ai/torch_core.html#Functions-to-deal-with-model-initialization

gmohandass · March 26, 2019, 2:39am

it’s more of a lazily evaluated sequence than a graph

rachel · March 26, 2019, 2:39am

The fastai library is written on top of PyTorch, for reasons outlined in these posts:
https://www.fast.ai/2017/09/08/introducing-pytorch-for-fastai/
https://www.fast.ai/2018/10/02/fastai-ai/

We will be covering fastai for Swift for TensorFlow (S4TF) in the last 2 sessions of this course:
https://www.fast.ai/2019/03/06/fastai-swift/

gietema · March 26, 2019, 2:39am

This short chapter explains yield pretty well:
https://book.pythontips.com/en/latest/coroutines.html

sgugger · March 26, 2019, 2:39am

That, and look in the layers module, bu we usually override PyTorch init when it’s faulty.

nok · March 26, 2019, 2:39am

could someone explain coroutine in few sentences?.. Tried to read some related post before but could not understand it clearly.

alando · March 26, 2019, 2:40am

it’s essentially a set of instructions for how to recreate some sort of iterable, correct?

iyersathya · March 26, 2019, 2:40am

Why do we always do zero_grad?

sgugger · March 26, 2019, 2:41am

This has been answered earlier in this thread., look for it

gmohandass · March 26, 2019, 2:41am

lazily executed instruction set

init_27 · March 26, 2019, 2:41am

I found a quick tutorial for yield: the-python-yield-keyword-explained

KevinB · March 26, 2019, 2:41am

Are there times when you wouldn’t want to zero out the gradients or why do we have to do those two steps separately?

opt.step and opt.zero_grad are the two lines I’m curious about

mmiakashs · March 26, 2019, 2:41am

otherwise it will be accumulated in future iterations.

devforfu · March 26, 2019, 2:41am

Probably this one also could be helpful. Doesn’t cover all possible cases but probably can give some additional insights.

sgugger · March 26, 2019, 2:42am

Rachel posted a link up there in the thread that explains the PyTorch’s team design choice, look for it, it’s really interesting.

andrea · March 26, 2019, 2:42am

Yes sorry, I didn’t formulate the question properly. I meant if it was paid any attention towards some integration with tf as well - not full integration, but some way to add value there as well. But I take it this wasn’t considered (I tried myself to integrate fast.ai to tf last month, but indeed didn’t find some natural integration points)

iyersathya · March 26, 2019, 2:42am

:), yes it will accumulate if we don’t clear it, but shouldn’t that be just a default, instead of setting it explicitly

ThomM · March 26, 2019, 2:43am

How does this ensure non-replacement? Is it because of yield - i.e. it “uses” the indices as it iterates along, and doesn’t reshuffle them every time?