Part 2 Lesson 8 wiki

(Michael) #300

I am currently watching lesson 9 and there Jeremy talks about the transformations in the “Bbox only” section in the jupyter notebook. As you can see at the transformed/rotated pictures of the woman the bounding box is not rotated and is only resized (and still aligned vertically and horizontally).

I guess to rotate the bounding box you have to adapt the class “CoordTransform”.
In addition, you have to use “tfm_y=TfmType.COORD” to transform the coordinates of the bounding box too.

However, maybe the pixel transformation can be of use with TfmType.PIXEL (see class “TfmType”, should be covered in later lessons on image segmentation).

(Ramon) #301

Hi @MicPie, Yes indeed, class "CoordTransform’ is the part of the code which is replacing the values by zero’s. That’s what I was looking for, thanks for your help. I’m trying to add a new TfmType to handle rotated boxes. Maybe in the end it’s better to use the PIXEL type, but in this way I’m getting to understand the frame more.

(Ramon) #302

Fixed it, if anyone is interested let me know :wink:

(Michael) #303

Nice work, looks very nice!

Do you also were able to train the neural network?
I would be interested if you use the coordinates of the four corners or maybe the two corners with a angle.

Best regards

(Ramon) #304

Hi @MicPie,

Not yet, but Ill continue next weekend. Ill keep you updated.
By the way, eventually I’m trying to solve the Kaggle airbus challenge (, interested in teaming up?

Kind regards,

(Thomas) #305

I am interested

(Ramon) #306

Hi @tcapelle,

Thanks for your response. Interested in the code and/or teaming up?


(David) #307

Would be interested in teaming up too!

(Ramon) #308

Great, I added you to the conversation, did you get that message?

(David) #309

…I did not…

(Thomas) #310

teaming up!
I have been playing with a kernel to detect ship presence first.


It seems there is no alias to data folder in the dl2 folder (contrary to dl1).

I just figured out how to create one on linux terminal to make the notebook work.
cd into dl2 folder, then :
“ln -s ~/data”


At the end of the pascal notebook, Jeremy unfreezes to layergroup -2. Then does lr find with the following result

From this graph, he chooses for the next iterations lr=5*10^-3

Can anyone explain to me wy he chooses such value ? (The graph doesn’t look like the usual “decrease then increase” graph…)

Thanks a lot !

(Nikhil) #314

Hi I had the same question. that, will rotating the input image, will rotate bounding box co-ordinates.
Does CoordTransform achieve this?
Does CoordTransform increases accuracy?

(Luca) #315

Pay attention to which cells are actually run. He got me too! These commands are not supposed to be run in succession, just skip the second sched.plot. I believe that is used earlier to test which values would work better when unfreezing last two layers!
No need really for a second lr_find here, or you can try it with lr_find(lrs) instead of lrs/1000.

(Leonardo Lima) #316

Hello dude, I got the same error.

This fixed open_image(str(IMG_PATH/im0_d[FILE_NAME]))


Pd: I’m using paperspace and just git pull

(Ramon) #317

CoordTransform takes to coordinates as input (top-left and bottom-right). On rotation (or other transformations), internally (in class CoordTransform):

  1. it draws a rectangle (y = CoordTransform.make_square(y0, x)) - red in attached picture:
  2. rotates this rectangle (y_tr = self.do_transform(y, True)) - green in attached picture
  3. grap the new top-left and bottom-right (return to_bb(y_tr)) - blue in attached picture

So the return is a new bounding box (not rotated) that’s larger than the original.


(Nikhil) #318

I was thinking it will return green box. Thanks for clarification.


I had the exact problem and it (finally!) got resolved by opening the Command Palette and running ‘Python:Build Workspace Symbols’.

(Raaj) #321

I understood what I was asking.