Try using U-net segmentation.
Awesome! Did you guys manage to get this working?
Also, do you have any thoughts on applying this to a new dataset and roughly how much training data you'd need to annotate?
Any update on Mask R-CNN code?
I found AffordanceNet code can do multiclass instance segmentation, but they only used VGG16.