Part 2 Lesson 14 Wiki

sgugger · May 1, 2018, 2:08am

If you want to try LARS, it’s very easy to implement as an optimizer in pytorch (did it in this gist).

gerardo · May 1, 2018, 2:11am

Isn’t that what the NVIDIA demo is doing?

ramesh · May 1, 2018, 2:12am

Are we using VGG16 n the model? SrResnet seems to build a model from Scratch?

kro · May 1, 2018, 2:17am

What is a “learnable convolution” and what is an example of a convolution that isn’t learnable?

kro · May 1, 2018, 2:17am

Curious about your context: isn’t what what the NVIDIA thing is doing?

gerardo · May 1, 2018, 2:20am

kro · May 1, 2018, 2:22am

Why are we using these little 3x3 squares of every color, instead of using noise in the new pixels?

I understand why we don’t just leave them blank, and maybe why we don’t copy the nearest-neighbors. But why not noise?

kro · May 1, 2018, 2:24am

Does this mean I can replace

m = nn.DataParallel(m, [0,2])

with something, to get rid of the error below?

RuntimeError: cuda runtime error (10) : invalid device ordinal at /opt/conda/conda-bld/pytorch_1518244421288/work/torch/lib/THC/THCTensorCopy.cu:204

kro · May 1, 2018, 2:25am

Because then the sequential layers would functionally just be one layer, I think.

blakewest · May 1, 2018, 2:34am

Yeah you probably want to change the [0,2] to only contain numbers that actually correspond to GPU’s on your computer. Like, maybe [0,1]?

nchukaobah · May 1, 2018, 2:34am

Can he explain progressive resizing again? I don’t understand how to use it

kro · May 1, 2018, 2:35am

thanks … but yeah I had tried [0,0] and it didn’t help; [0,1] didn’t either.
I wonder how to find out what the correct values would be!!

Borz · May 1, 2018, 2:38am

Huh… I wonder if using load state_dict(strict=False) would work as a quick way to load weights from a pretrained model. Say: pretrained keras/tensflow retinanet, if you more/less match the architecture in pytorch.

snagpaul · May 1, 2018, 2:39am

Also, can we use progressive resizing to match the idea of backbone + head?

KevinB · May 1, 2018, 2:40am

Is that a checkerboard pattern on the bluejay?

kro · May 1, 2018, 2:43am

where?

KevinB · May 1, 2018, 2:49am

on the neck and head area of the bluejay it seemed to checkerboard.

KevinB · May 1, 2018, 3:14am

@rachel if there is a good time to fit this in, I’m curious if Jeremy has an explanation.
If not, no biggie!

fmichaelkunz · May 1, 2018, 3:16am

DONT DO A PhD. I was enrolled for two years and dropped out. More opportunities outside without a PhD.

matttrent · May 1, 2018, 3:18am

Or at the very least if you do one, take everything your professors tell you with a grain of salt (they have no idea what goes on in the real world). And do as many industry internships / work part time as you can.

I have a PhD, and worked for 2 startups during my degrees. There’s still a definitely opportunity cost to spend that many years of your life. But if you go in with the understanding that your responsible for also ensuring your industry success afterwards, you can leverage it into some interesting skills beyond your specific subject area.