From the lesson 7 notes …
We can extend the efficiency of this model by predicting bounding boxes and then feeding a cropped image of the fish in the bounding box to a classification model.
Would we feed the cropped images into a separate model that we create and train, and then use the output of that classification model for our final predictions/submissions?
How would we handle different sizes of cropped images since they will vary based on the bounding box?
Also, would it be possible/worthwhile to train a classifier on unrelated images of the different kinds of fish we need to identify … and then somehow use that data to find the fish in the fisheries competition images? Perhaps even be able to find multiple fish in the same image even???