Lesson 10 Discussion & Wiki (2019)

jeremy · April 3, 2019, 6:37pm

(This is a wiki post - please edit!)

Errata

The layer and instance norm code in the video use std instead of var. This is fixed in the notebook
I said binomial when I meant binary. Also shown incorrectly in the XL spreadsheet (now fixed).
The variance of a batch of one calculates to 0, not infinity. (With some technical exceptions.) Therefore BatchNorm would attempt to scale the filter to infinity.

Lesson resources

Lesson video

Papers mentioned this week

Notes and other resources

Other relevant papers

Revisiting Small Batch Training for Deep Neural Networks

Papers for next week

All you need is a good init
mixup: Beyond Empirical Risk Minimization
Rethinking the Inception Architecture for Computer Vision (label smoothing is in part 7)
Adam: A Method for Stochastic Optimization
Decoupled Weight Decay Regularization (AdamW)
Bag of Tricks for Image Classification with Convolutional Neural Networks

stas · April 4, 2019, 1:26am

pip install torch_nightly -f https://download.pytorch.org/whl/nightly/cu100/torch_nightly.html

replace cu100 with cu90 if you’re on cuda-9.0

All options are here: https://pytorch.org/get-started/locally/

devforfu · April 4, 2019, 1:37am

Also, I guess the nvidia-smi shows some information about GPU drivers. (In case if you use some other library or framework).

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 418.39       Driver Version: 418.39       CUDA Version: 10.1     |
+-------------------------------+----------------------+----------------------+

tamhash · April 4, 2019, 1:37am

Any good resource for start reading about software engineering? Frankly, I find it bit overwhelming.

nbharatula · April 4, 2019, 1:38am

Even the slides are feeling a bit grey today with all this overwhelming content!

maxim.pechyonkin · April 4, 2019, 1:40am

I was actually thinking of proposing to do fastai.audio but I felt my skills are not enough to help implement it so I didn’t.

gamino · April 4, 2019, 1:40am

We are going to need a bigger model…

PierreO · April 4, 2019, 1:41am

Is it advisable to get familiar with swift before lesson 13?

sgugger · April 4, 2019, 1:42am

You don’t need to, the first lesson should be understandable without since Jeremy and Chris will cover the basics. If you have time to, it certainly won’t do any harm… but there is also a lot of material in those lessons in PyTorch

charming · April 4, 2019, 1:43am

The next course is very cutting edge and fun, great! But I am worried if we have enough time to finish all this？

PierreO · April 4, 2019, 1:43am

Thanks! Expanding on that, do you have any good ressources to start learning swift? Maybe there’s a thread about that on harebrain, haven’t been there a lot yet.

z0k · April 4, 2019, 1:44am

Here’s a discussion of some resources Best book, podcast, tutorial a.o. for swift architecture & design.

sgugger · April 4, 2019, 1:45am

The official book, more specifically its introduction is really great to discover all the basic functionality in Swift. Bonus: it can be opened in playgrounds on an ipad/mac if you have one and be interactive.

devforfu · April 4, 2019, 1:45am

It is possible to have both PyTorch’s prepackaged CUDA and “manually” installed one? Is there a chance for a conflict between these two?

neuradai · April 4, 2019, 1:46am

The tutorial notebooks on the GitHub page are a pretty good intro after you’ve gone through the Swift book.

sgugger · April 4, 2019, 1:46am

If you try to use JIT (see later in the lesson) with a manually installed CUDA different than your PyTorch CUDA, there is going to be hell to pay.

stas · April 4, 2019, 1:46am

Yes.

Is there a chance for a conflict between these two?

No. Assuming you’re using a prebuilt-pytorch.

If you’re building from source you will need the system-wide cuda

sgugger · April 4, 2019, 1:47am

I had some actually, but maybe it has been fixed by PyTorch since then.

manan · April 4, 2019, 1:49am

Why does the callback function f used for widget takes an argument ‘o’?