Passthrough CNN that learns to predict its input

dusan · November 3, 2018, 1:12pm

I’m trying to make a simple passthrough CNN that just predicts the image it receives on input. I would assume that for a 3 channel image, 3 filters of size 1, stride 1 with no padding should be able to do the job perfectly but they are failing to do that (using L1 loss):

class PassthroughNet(nn.Module):

def __init__(self, ch_in, filters):
    super().__init__()
    self.conv_final = nn.Conv2d(ch_in, ch_in, kernel_size=1, stride=1, padding=0)
    
def forward(self, x):
    x = self.conv_final(x)
    return x

Sample predictions:

Should this work?

Source notebook:

BBloggsbott · December 12, 2018, 10:45am

that just predicts the image it receives on input.

Like an Auto encoder?

bluesky314 · December 12, 2018, 5:19pm

Did you end up fixing this? Interested to know what happened…

BBloggsbott · December 13, 2018, 4:43am

No. Not yet.

I just don’t understand it’s purpose. Why do we need a model that is trained to return what it gets?

bluesky314 · December 13, 2018, 10:20am

I was asking @dusan. But, this is a toy experiment to understand convolutions better. You may see how much time a CNN takes to learn a simple identity mapping of all 1 filters or see if it even ever reaches there perfectly. Or explore the optimization necessary for it to reach it. Purpose is experimentation.

BBloggsbott · December 13, 2018, 11:33am

To understand convolutions, you can check out my code

https://www.kaggle.com/bbloggsbott/understanding-convolutional-neural-network

Or this paper might help too

I’ll try to modify @dusan 's code and let you know if I make any progress.

dusan · December 23, 2018, 11:25pm

Sorry for the late answer guys. I did not tinker with it much more so I am not sure what’s going on. Either some part of the input information is irretrievably lost or the net just goes in wrong direction during learning and gets stuck? Just my hypotheses, could be something else, more experiments needed? Dunno