Lesson 9 in-class

rachel · March 7, 2017, 3:56am

@harveryslash Here we are also getting back what we had before the convolution (as much as we can, not perfectly recoverable)

Surya501 · March 7, 2017, 3:57am

I assume that if we run this solver for the deconv filter over multiple images, the error should go lower over time as we optimize the deconv over multiple input images. Is that correct?

Kjeanclaude · March 7, 2017, 3:58am

Is it possible to reduce the size of the input ?
arr_lr = bcolz.open(dpath+‘trn_resized_72.bc’)[:]
arr_hr = bcolz.open(dpath+‘trn_resized_288.bc’)[:]

I have tried it to see the effect but got some mistakes.
If yes how to ?

rachel · March 7, 2017, 3:58am

@Surya501 Yes!

vinvinvin · March 7, 2017, 4:00am

Why deconvolutional output size must be specified:

http://stackoverflow.com/questions/39018767/deconvolution2d-layer-in-keras

While given the size of a convolution kernel and its stride, it is straightforward to compute the output shape of the convolution layer (assuming no padding it's (input - kernel) // stride + 1), but the reverse is not true. In fact, there can be more than one possible input shapes that matches a given output shape of the convolution layer (this is because integer division isn't invertible). This means that for a deconvolution layer, the output shape cannot be directly determined simply from the input shape (which is implicitly known), kernel size and stride - this is why we need to know the output shape when we initialize the layer. Of course, because of the way the deconvolution layer is defined, for some input shapes you'll get holes in its output which are undefined, and if we forbid these cases then we actually can deduce the output shape.

nima · March 7, 2017, 4:00am

The 127.5 Lambda function at the end of the upsampling network is to normalize the output image?

thejaswi.hr · March 7, 2017, 4:02am

Why use VGG for loss? If we do then wouldn’t we get good results only for the 1000 classes in Imagenet? Can we use pixel to pixel loss instead?

rachel · March 7, 2017, 4:03am

@thejaswi.hr pixel to pixel loss creates blurry outputs

Even · March 7, 2017, 4:05am

If we’re only using up to block2_conv2, could we pop all layers afterwards and save some computation?

We don’t care about the results after that, correct?

paulm · March 7, 2017, 4:06am

Can you explain the size of targ? It’s the first dimension of the high res * 128? How come?

sakiran · March 7, 2017, 4:08am

Can we somehow use the pre-computed weights?

hamelsmu · March 7, 2017, 4:08am

Wait but would popping the unused layers really save anything? I thought he is already only getting the layers he wants with vgg_content = Model(vgg_inp, vgg.get_layer('block2_conv2').output)

brendan · March 7, 2017, 4:09am

Intuitively, what features is this model actually learning?

ben.bowles · March 7, 2017, 4:11am

I feel like, if we had some kind of VGG-like model that was trained extremely well in a particular domain of images, then in theory this could do extremely well for low res images in that domain?

sakiran · March 7, 2017, 4:14am

Just a random idea I got.
Like we did a doodle regeneration, using the model’s photographs’ weights and then trying to optimize the weights. Is it possible to create a regular image to how would you look if you were a model?!

brendan · March 7, 2017, 4:16am

Style transfer your face with Brad Pitts?

sakiran · March 7, 2017, 4:20am

Exactly, that’s what I was referring to. Something like that?!

cody · March 7, 2017, 4:20am

What does it mean to have “stride 1/2” here?

rachel · March 7, 2017, 4:21am

@cody that is a deconvolution

hamelsmu · March 7, 2017, 4:22am

I don’t understand the stride = 1/2 either, what does that mean, really?