Advice for image classification when classes are very, very similar

wgpubs · November 8, 2017, 4:31am

For example, imagine your task is to differentiate between American Labrador Retrievers and English labs.

What can we do to improve the abilities of our classifier given that they are very very similar. I’ve tried running through a dataset like this and can’t get the classifier to do better than a coin flip.

jeremy · November 8, 2017, 4:51am

I don’t think there’s any special approaches in this case. If you can provide more details about the dataset you’re using, and what you’ve tried, we can try to give specific recommendations.

wgpubs · November 8, 2017, 5:07am

This may sound kind of lame, but I’m trying to build a classifier that can distinguish between In-N-Out burgers and other burgers.

I have about 400 pictures of In-N-Out burgers and almost 2,000 images of other burgers. Most of the images are pretty big.

So far, I’ve tried basically using the lesson 1 approach on the dataset. I’m only getting about .18 accuracy and my validation loss is NaN (which can’t be good).

jeremy · November 8, 2017, 5:11am

Sounds kinda awesome to me. Not that I like In-N-Out burgers myself - can’t understand the attraction…

Anyhoo if you’re getting NaN loss something is very wrong. I’d guess a too large learning rate. If you show a screenshot of the training process we’d be able to tell more.

wgpubs · November 8, 2017, 5:17am

jeremy · November 8, 2017, 5:25am

What arch and sz are you using? How big are the original images? Can you show a couple of examples? That lr_find plot looks really odd…

wgpubs · November 8, 2017, 5:30am

arch = tried resnet34
sz = 224

Images are pretty big … I think the average was something around 1000x800

Positive examples:

Negative Examples:
bk_4 bk_137

wgpubs · November 8, 2017, 5:34am

Could it be because I am not running on GPU?

I’m just testing on my macbook pro … just using the CPU.

KevinB · November 8, 2017, 5:40am

I don’t have any advice here, but this is an amazing project!

jeremy · November 8, 2017, 5:41am

The CPU code is new and I haven’t looked at it. Worth trying on a GPU. Also worth trying a very small learning rate. Need to figure out where that NaN is coming from…

wgpubs · November 8, 2017, 5:42am

For us in Southern California, eating any other kind of burger is borderline blasphemous so it could save lives, or even, souls.

KevinB · November 8, 2017, 5:43am

I don’t think they have any of those in Nebraska. They do look good though.

jeremy · November 8, 2017, 5:44am

Nah they’re really not. And the fries are always drier than any fry deserves to be. Sorry @wgpubs I can see you feel strongly about this, but I just can’t avoid blasphemy.

wgpubs · November 8, 2017, 5:48am

I’m going to agree with you about the fries … but only the fries.

You got me curious though, what are you folks eating in the way of burgers up in northern California?

ravivijay · November 8, 2017, 5:53am

Awesome project. My vote is for In-n-out as well and am sure once your model works, it will pick it too!

jeremy · November 8, 2017, 5:57am

wgpubs · November 8, 2017, 6:31am

Never heard of it.

I got a kid applying to some schools up north, when we do our road-trip in a couple months I’ll check it out.

wgpubs · November 8, 2017, 6:39am

Running on the GPU and its working beautifully.

Getting 93.1% accuracy before TTA … 92.9% after TTA.

Now I just have to figure how to build an Android app that can run this model … any advice on how to “productionize” our work would be appreciated.

… and thanks again for your help!

sermakarevich · November 8, 2017, 6:44am

I still see .cuda assignments without checking if cuda is available

jeremy · November 8, 2017, 12:52pm

A PR to fix the CPU training would be welcome