Improve model for confused classes

ste · April 5, 2019, 5:06pm

Luckily I’ve got some interesting result using a kind of “Curriculum Learning”, feeding the model with high frequency classes first and low frequency later (using only frequency is very naive approach as @aamir7117 told me a lot of times ).
Focusing on samples where the model is not “confused” is way better and actually pretty simple to implement if you feed your own classes in the labelling step.

Train first with all data and measure the performance.
Pretend your model is good on classes A.B but confused on C,D,E. Then:

Create classes list

allClasses = ['A','B','C','D','E']

Create databunch for high frequency classes

dataAB = (ImageList.from_df(dataAB, path, cols=['Image'])
        .split_from_df('is_valid')
        .label_from_df('LBL', classes=allClasses) # THIS IS THE IMPORTANT STEP!
        .transform(tfms)
        .databunch(bs=64))

Create the learner with dataAB

learn = cnn_learner(dataAB, models.resnet50)

Train the model (should have higher accouracy)
Create databunch for all classes (dataAll)
Change the databunch in the learner

learn.data = dataAll

Train with all classes

CURRICULUM LEARNING