I’m trying to train a “Faster R-CNN” to crop to products in ad catalogues (like the image below). The products are small and although I’ve tried decreasing the
anchor_box_scales, my output does not look good. I wonder if I just need to label more data - it’s just a binary (product vs background) detection so was hoping 20 images or so is enough - does anyone have an idea of how much data you need?
I am using this Keras implementation: