Lesson 1. Bounding box detection in fastai-1.x

ilya_bohaslauchyk · May 4, 2019, 8:59pm

I try to replicate the code from the lesson 1 of part 2 using fastai-1.0.52.
I create a databunch using ObjectItemList. In this case y are instances of ImageBBox class and the model is fed with the correct answers of the shape [-0.1658, -0.0735, 0.3601, 0.7672]. Accordingly the trained model gives predictions of the same shape. What is the correct way to transform these predictions into actual coordinates of the top left and bottom right corners in pixels and how to show an image with a predicted bounding box?

ilya_bohaslauchyk · May 9, 2019, 11:54am

Probably I should have red the docs carefully. Looks like that because ImageBBox scales the coordinates to the range (-1,1), the correct way would be (prediction + 1) * 112 for a square image with height = width = 224 pixels.

sagar1 · July 2, 2019, 1:15am

Hi @ilya_bohaslauchyk, could you share your code/notebook for the same. I am stuck on this for quite sometime now. and getting errors during databunch creation.

sidravic · April 7, 2020, 8:23am

@ilya_bohaslauchyk My apologies for responding to an old conversation. Your comment was super helpful. I’ve been stuck on this for a bit.

Could you help understand why 112 makes sense here and if there is a way to approach the single object detection more efficiently for batch predictions?

I have a fairly brute force approach at the moment using fastai v1.

Here I’m simply trying to visualize the prediction coordinates on the validation dataset.

for idx, pred in enumerate(preds):
    if idx > 20: break
    coords = (pred.data + 1)* 112
    coords = coords.tolist()
    
    category = ll.data.valid_ds.y[idx].labels[0]
    img = ll.data.valid_ds[idx][0]        

    bbox = ImageBBox.create(*img.size, [coords], labels=[0], classes=[category])    
    _display_with_bbox(img, bbox)

Thanks.