Part 2 Lesson 8 wiki

pnvijay · March 26, 2018, 7:28am

Thanks! Visual Studio Code is free while I think PyCharm is subscription based. Do we have a free version of PyCharm.

cadenza · March 26, 2018, 7:59am

Yes, free “community” version is here: https://www.jetbrains.com/pycharm/download/#section=mac

Also see this thread for additional info on setup:

NaOlimar · March 26, 2018, 8:01am

Oh yes my bad ! Thanks for the videos.
Looking at the block 107 and 108:

(‘ReLU-107’,
OrderedDict([(‘input_shape’, [-1, 512, 7, 7]),
(‘output_shape’, [-1, 512, 7, 7]),
(‘nb_params’, 0)])),
(‘BasicBlock-108’,
OrderedDict([(‘input_shape’, [-1, 256, 14, 14]),
(‘output_shape’, [-1, 512, 7, 7]),
(‘nb_params’, 0)])),

the input of the output if the block 107 is 51277 and the tinput of the 108 is 2561414.
Is there any operation between the two layers which can explain that ? I was thinking about the residual concatenation but I don’t see how it could explain that do you have any hint ?

vikbehal · March 26, 2018, 11:52am

Thank you @jeremy & @asif.imran. I followed the 3 commands and could install fast-ai cpu & on my machine. Then, I selected the Interpreter in the VS Code.

Eventually, I searched for #open_image but it doesn’t show any result. Any thoughts?

i

jeremy · March 26, 2018, 1:21pm

That’s kind-of true. Although what’s “top” and “bottom” depends on your coordinate plane layout (as you noticed!) It’s very hard to write about all this in a way that’s clear and perfectly accurate! In this case the docs, when they say “bottom”, mean “lower numbered”.

Generally when we label coordinates of a cell in a matrix, or a pixel in an image, we order the y axis top to bottom. But in charting, we label it bottom to top. I’m not sure there’s a way to work around this confusion!

The important question is - have I managed to confuse myself and screw up the definition of any of the models as a result? Frankly. I’m never 100% sure… I think it’s all OK, and it seems to be working OK, but if you see anything that looks backwards please do feel free to experiment to see if you can get better results by changing things around.

NitinP · March 26, 2018, 3:47pm

Thank you very much Jeremy for your detailed explanation.

According to your suggestion, I will certainly test the results of the model by changing the corner points and will update in case of performance difference.

Ducky · March 26, 2018, 4:12pm

Slight clarification, it remembers it for the life of the shell. If you e.g. shut down your machine and start it up again, you’ll need to activate your conda environment again.

jeremy · March 26, 2018, 6:13pm

We may be discussing different things. I was referring to the selection of env/interpreter in vscode. This is remembered across sessions.

asif.imran · March 26, 2018, 6:25pm

Please don’t feel bad. I am learning this myself

So, I may have been too quick last time. You actually want to look at this:

('Conv2d-100',
              OrderedDict([('input_shape', [-1, 256, 14, 14]),
                           ('output_shape', [-1, 512, 7, 7]),
                           ('trainable', False),
                           ('nb_params', 1179648)])),

This corresponds to the start of the following block:

(7): Sequential(
    (0): BasicBlock(
      (conv1): Conv2d (256, 512, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
      (bn1): BatchNorm2d(512, eps=1e-05, momentum=0.1, affine=True)

<removed>

So, if you put all the info together:

input is 256 x 14 x 14
conv2d volume of 256 x 3 x 3
number of channels is 512

This gives you output that is 512 x 7 x 7

Best,
A

jeremy · March 26, 2018, 9:18pm

…because the stride is 2x2

binga · March 26, 2018, 10:32pm

While I found it slightly difficult to wrap my head around the collections.defaultdict and a ton of dictionaries to create the final bounding box dataset (for largest item classifier), I’ve put up a gist on how to get this done in real quick time using pandas and a couple of apply commands.

Jeremy suggested to improve the apply part (cos it’s serial) and I’ll do it soon.

gist.github.com

https://gist.github.com/binga/1bc4ebe5e41f670f5954d2ffa9d6c0ed

pascal_pandas.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {
    "collapsed": true
   },
   "outputs": [],
   "source": [

This file has been truncated. show original

Ganesh from the SF weekend study group also shared this idea. Thank you.

Edit: Updated the link of the gist with the minor changes to the source code.

jeremy · March 27, 2018, 12:15am

Yeah I like this a lot better than how I did it.

vikasbahirwani · March 27, 2018, 12:58am

I am a mac user too and vouch for PyCharm

NaOlimar · March 27, 2018, 2:40am

Hmm thanks I think I understand well this part
But do you have any idea why the output of the block 107:

(‘ReLU-107’,
OrderedDict([(‘input_shape’, [-1, 512, 7, 7]),
(‘output_shape’, [-1, 512, 7, 7]),

does not match the input of the block 108:

(‘BasicBlock-108’,
OrderedDict([(‘input_shape’, [-1, 256, 14, 14]),
(‘output_shape’, [-1, 512, 7, 7]),

How can this “work” without any in-between operation ?

Vijay · March 27, 2018, 8:34am

Install CTAGs…

Download from http://ctags.sourceforge.net/
Unzip in any folder
update this in PATH environment variable.

Moody · March 27, 2018, 6:23pm

Should we change from [‘bbox’] to [‘bbox_new’] as below? Otherwise, the bbox_str won’t capture the new orientations.

largest_bbox[‘bbox_str’] = largest_bbox[‘bbox_new’].apply(lambda x: ’ '.join(str(y) for y in x))

binga · March 28, 2018, 1:36am

Oh yes. You’re right. I’ll update the gist in a while.

Thank you.

Edit: Done.

RanjeetSingh · March 28, 2018, 9:20am

argument to the open_image function has to be casted to string. So use like this open_image(str(IMG_PATH/trn_fns[i]))

RanjeetSingh · March 28, 2018, 9:21am

Cast argument to string, use like this-
open_image(str(IMG_PATH/trn_fns[i]))

parag_goyal · March 28, 2018, 10:32am

i have an error in F.softmax(predict_batch(learn.model,x),-1)

x,y=next(iter(md.val_dl))
probs=F.softmax(predict_batch(learn.model,x),-1)
x,preds=to_np(x),to_np(probs)
preds=np.argmax(preds,-1)

NameError: name ‘predict_batch’ is not defined

I have searched for predict_batch in whole notebook but it wasn’t there.Maybe it is used in past lectures.Please help me.
Thanks in advance.