Lesson 10 Discussion & Wiki (2019)

jeremy · April 4, 2019, 5:35am

Ah you caught me! No we didn’t.

But… we did show that conv is just a matrix multiply, with some tied weights and zeros, and we’ve already done that from scratch; so I figured we don’t gain much doing conv from scratch too. And it would be soooooo slooooow.

But for folks still feeling a little unsure about what a conv does - you absolutely should write it yourself!

jeremy · April 4, 2019, 5:36am

It’s fine to have a negative class for a binary problem (NLP, vision, or anything else) since it’s simply sigmoid activation and we don’t have this same issue.

But we don’t have a negative class for multi-class NLP problems IIRC…

jeremy · April 4, 2019, 5:37am

Yes, if you know you have one and exactly one class represented in each data item, then softmax is best, since you’re helping the model by giving it one less thing to learn.

jeremy · April 4, 2019, 5:38am

nano doesn’t really do enough to be useful. I wouldn’t suggest spending time learning it. Use vim or emacs. Emacs is a little easy to get started with, although vim is better for manipulating datasets (although there are emacs extensions to help there).

jeremy · April 4, 2019, 5:39am

Yes that’s what I was using. It’s pretty basic but it’s ok.

jeremy · April 4, 2019, 5:40am

It’s negligible. But you can check for yourself - use %timeit to see how long an if statement takes in python. Then compare that to the number of batches we do to train a model, and see what you think.

ste · April 4, 2019, 6:08am

I’ll make a post on that tomorrow.

PierreO · April 4, 2019, 6:22am

Other ressources to help with understanding convolutions and building them from scratch:

sayko · April 4, 2019, 9:02am

as Jeremy said in lesson 8, it suppose to keep us busy until next course.

marcmuc · April 4, 2019, 12:51pm

Is there a specific reason why we continue using standard deviation in our convnet model, after Jeremy explains that mean absolute deviation is often better? Or does this statement not apply at all to Batchnorm etc. somehow? (Maybe that is one of those “try blah and see” experiments? )

jeremy · April 4, 2019, 1:30pm

Because that’s what everyone has always done, so I made something I knew would work to show in class. It would be interesting to try abs instead. My guess is would work about equally well. Let me know what you find if you try it!

simoneva · April 4, 2019, 1:34pm

Why is the Runner not part of learner? If intention is to keep the learner free of code then why not just have the runner as a member of learner like model, data, loss? It seems weird to call runner.fit(1, learn) rather than just learn.fit(1).

sgugger · April 4, 2019, 1:41pm

Look at 09b
We thought of it later, but the Runner will be merged inside the Learner ultimately.

tanyaroosta · April 4, 2019, 4:16pm

and you can write a few more great blogs to make it easier for others to understand

tanyaroosta · April 4, 2019, 4:23pm

I wasn’t clear in my question, but what if we are using a non convnet architecture? Would there be a channel dimension then?

tanyaroosta · April 4, 2019, 4:24pm

Just to add one more: https://www.coursera.org/learn/convolutional-neural-networks/home/week/1

jeremy · April 4, 2019, 4:58pm

Yes.

deena-b · April 4, 2019, 5:39pm

If you don’t feel ready to take on Vim or Emacs yet, I suggest you download VS code as an intermediate step. As Jeremy mentioned, you can hover or right click on, say an object that is inherited by a class and it will take you to the source code.

Note that it is important to be in the correct Python environment for VScode. You have to select an interpreter, see instructions here. Folders are also important. I can show you what I know so far - text me some times when you’re available.

I liked Jeremy’s mantra from lesson 10:

“activations are things we calculate
parameters are things we learn”

(corrected - thanks @Kaspar)

Lankinen · April 4, 2019, 5:40pm

We are going to learn about audio… How about Jeremy showing how to win this competition

mnpinto · April 4, 2019, 5:55pm

It will probably be a good homework exercise after the next lesson