Detectron2 anyone?

My DataBlock is eassier, is just a dict.

I am using it just for the target and keeping ImageBlock for the input.

It works pretty straidforward. I am not using transforms like resize and normalize. It is done in first layer of torchvision.models.detection.maskrcnn_resnet50_fpn.