Any Ideas on how to modify Resnet with Images that have 2 channels?

kcturgutlu · November 1, 2017, 12:00am

Hi,

After having the first course yesterday I want to replicate the notebook and try it on a different dataset to both challenge myself and have some fun.

So there is a Kaggle competition going on right on link here: https://www.kaggle.com/statoil

Most of the kernels I’ve come across are defining their own CNN classes which seems to be the right way to go with this competition since image data consists of 2 channels (1 - HH band image, 2 - HV band image) .

My question is this: Would there be a way to hack Resnet in order to allow to train it for this problem. These might sound stupid since I am fairly new to DL but some of the ideas I am came up with are:

Adding a 3rd channel which will be equal for each image. This way the kernels won’t be affected by the 3rd channel while they are being updated and the net might only focus on our original 2 channels.

img = (np.array(train['band_1'][0]) + np.array(train['band_2'][0])).reshape(75, 75) + np.zeros((75, 75))

Redefining the architecture to allow 2 channel images. This seemed really hard to me At least what I’ve understand from complex ResNet paper is that many layer’s depend on the residuals of the previous later. So only updating first convolutional layer to let’s say to (75, 75, 2) won’t do it.

Thanks !

ramesh · November 1, 2017, 1:00am

Hi @kcturgutlu,

It’s great to see that you are already looking at other datasets that you could apply what you learnt in Lesson 1. For this competition from statoil, you may not be able to use Resnet. The dataset dimensions are only 75x75x2 and Resnet requires a minimum size of 197 on width and height along it being a 3D (RGB) Tensor. So, I would suggest you -

Choose a different competition in Kaggle that you might be able to directly apply what you learnt even if it’s a kaggle competition that’s not currently running. Fisheries is a good one - https://www.kaggle.com/c/the-nature-conservancy-fisheries-monitoring
Build a CNN from scratch using fastai library. If you poke around the examples in the repo, I am sure you will find some sample code to get started.

Good luck and best wishes from your fellow learner.

ramesh · November 1, 2017, 1:12am

I would also recommend trying your had at Dog Breed Classification Kaggle Competition - https://www.kaggle.com/c/dog-breed-identification. It might be closest to the lesson 1 - cats vs dogs.

kcturgutlu · November 1, 2017, 1:32am

Thanks ramesh

thiago · November 1, 2017, 1:54am

How about convert the images to RGB and resize to 197px on width?

kcturgutlu · November 1, 2017, 3:03am

I followed the same steps in the notebook after creating dirs data / train valid test from images and it works with size 75

I basically summed up vertical and horizontal images as it is suppose to give composite color images.
Then saved them as the fastai library requires in jpg format using 70-30 % startified sampling.

jeremy · November 1, 2017, 4:04am

Cool! So… how did it go?

kcturgutlu · November 1, 2017, 4:05am

I need to lower the batch size I guess, since there is not enough iterations to see the learning rate go down and rise again. I tried sz but I guess it’s the image size. So far with no tuning ~0.73 on validation.

kcturgutlu · November 1, 2017, 4:09am

jeremy · November 1, 2017, 4:11am

You need the bs param when creating your data object.

kcturgutlu · November 1, 2017, 4:13am

Ok thanks!

kcturgutlu · November 1, 2017, 4:15am

What is the default for bs ?

jeremy · November 1, 2017, 4:16am

shift-tab will show you

kcturgutlu · November 1, 2017, 4:18am

Right I decreased my batch size but now it’s giving lower accuracy in validation set should I just experiment with different bs - lr combinations ? Thanks

jeremy · November 1, 2017, 4:20am

Yup pretty much. Although based on that LR chart you show above it look like a tiny dataset?

kcturgutlu · November 1, 2017, 4:22am

Yes, 1122 train 482 val (70-30) and 8500 test

jeremy · November 1, 2017, 4:24am

Oh yeah that is small. You’ll have to play around with hyperparams to see what works best… You may find resnet18 works better for such a small dataset, and may need more dropout (which we’ll learn about later).

kcturgutlu · November 1, 2017, 4:28am

Thanks so much for the tips, I will try my best and submit so far what I can do. Then will wait for the next classes!

jeremy · November 1, 2017, 4:31am

Let us know how it goes. Perhaps show a Kaggle kernel with your work once it’s going ok?

kcturgutlu · November 1, 2017, 4:48am

Yes sure, can we share jupyter notebooks here ? I guess Tim did it before right ?