Good afternoon. I tried to create a model to predict the coordinates of items on pictures, as seen in the lesson 3 Regression with BIWI head pose.
In the end I got the following error when I want to create my dataset in a similar way :
It’s not possible to collate samples of your dataset together in a batch.
Shapes of the inputs/targets:
[[torch.Size([3, 160, 160]), torch.Size([3, 160, 160])], [torch.Size([152, 2]), torch.Size([1, 2])]]
Each pictures can have different number of items/targets to detect (here there has 152 coordinates x/y to predict for one image vs an image with only one item). Do I am trying to do something impossible ? ^^ Or this a specific part of the doc I should look at ? (or maybe change the dimension of my tensor ? something like 152 x 1 x 2 ?).