Visualizing intermediate layers a la Zeiler and Fergus

dhoa · November 3, 2018, 8:00pm

very happy to know that my code is useful.

The source code of flatten_model is : flatten_model = lambda m: sum(map(flatten_model,m.children()),[]) if num_children(m) else [m]

And I found it by looking at the hooks.py in /fastai class HookCallback(LearnerCallback):

def on_train_begin(self, **kwargs):
    if not self.modules:
        self.modules = [m for m in flatten_model(self.learn.model)
                        if hasattr(m, 'weight')]
    self.hooks = Hooks(self.modules, self.hook)

It indicates that if modules are not specified then choose all the modules which have weight in the model. So for yours interest, you just need to choose the last linear layer. I got familiar with these things by playing around with the ActivationStats and model_sizes in hooks.py

I think for getting just the activations, you don’t need callback. The hook_outputs is enough. Actually, I haven’t played with the callback yet still not familiar yet with the the method in callback.

I’m very appreciated if someone can give more examples of this topic too

jeremy · November 3, 2018, 8:30pm

Using the callback should make your code a little simpler. Amongst other things, it can automatically remove your hook for you when done.

dhoa · November 3, 2018, 8:37pm

Thanks Jeremy. Can i add the callback after the model have already trained ? Because in this case I train the model first then try to get the activation in the validation set

jeremy · November 3, 2018, 8:57pm

Sure - there are params for callbacks in get_preds and validate, amongst others.

henripal · November 9, 2018, 5:44am

Hey, if useful I implemented Grad-CAM using fastai 1.0; allows to see which parts of the image are “used” by the network to make a prediction. It’s a variation on CAM using averaging with gradients.

This might also be interesting to some of you because it has gradient/backward hooks as well as the output_hooks.

notebook

Example result:

cwerner · November 10, 2018, 10:27pm

Very cool. Any chance we can tackle the improved grad-cam++

https://arxiv.org/abs/1710.11063

They have a TensorFlow implementation on github…

henripal · November 11, 2018, 12:48am

Yes ! I’d love to try that out (also the guided variants, because Grad-Cam’s resolution is awful for the application I tried, sat imagery).

cwerner · November 11, 2018, 1:11am

Yeah, I also wondered about the poor resolution? Are there other/ better methods for this? Or is grad-cam (++) still state of the art?

MicPie · November 11, 2018, 10:51am

Thank you for your great and instructive notebook!
I incorporated it into my lung tissue pet-project.

While running the code several times I sometimes encountered a strange bug that I was able to fix by setting the model in evaluation mode with learn.model.eval().

henripal · November 11, 2018, 2:42pm

Poor resolution is inevitable - the output of the “bottleneck” units in resnet are 7x7 feature mapsz so all the CAM methods are going to be constrained by this.

I’m not really sure about state of the art (and it’s probably hard to define?) - especially for satellites, I did the PCA/tsne on the last later vectors and it’s apparent that the network is distinguishing “twisty roads” from “grid cities” and “big blocks” vs “lots of little houses”. There’s also a huge color component; beiges and ocres from Mediterranean citiess, …

So I’m thinking to try out a bunch of things and see what sticks

henripal · November 11, 2018, 2:43pm

Thank you so much!

I had the same issue and just chalked it up to the algorithm failing sometimes !!

henripal · November 12, 2018, 8:14pm

Hi - added Guided Backprop and Grad-CAM guided backprop to the notebook. Literally three lines of code using fastai Hooks . . .

The idea is to determine the importance of each pixel to the prediction that is made by the network. To avoid interference, the backprop is clipped to positive gradient contributions. This was introduced in Striving for Simplicity: the All Convolutional Net paper.

Example output:

Element-wise multiplication with grad-cam:

And here is the notebook, also showing what happens when you just plot the gradients, without doing guided backprop.

cwerner · November 14, 2018, 8:18am

very cool. I was just about to try that in my notebook, too

balnazzar · December 20, 2018, 7:00pm

I’m not catching the exact role of global average pooling in CAM. If you read the paper, one of the major points the authors of CAM insist upon is the ability of Global Average Pooled CNNs to extract features and indentify objects even if they are not specifically trained to do so.

By GAP-CNNs they mean a CNNs which is bestowed with a GAP layer just between the last convolutional and the final fully connected classifier.

The authors summarize the general setting:

Then, they go on and define how the CAM itself can be obtained:

The equation to look upon is (2), since it defines every spatial element of the CAM. In this way they get a tensor which has the same dimensions of the last feature map outputted by the last conv layer. Then, typically one upsamples such map and visualize it in overlay with the original image Like @henripal did, to ‘see what the cnn considers important for classifying such image’.

My question is:

The authors insist a lot about the importance of Global Average Pooling, but in the end, the define their CAMs in a way that does not consider the GAP layer at all. Written as such, it’s like the Dense layer is directly connected with the last conv. On the other hand, if plug in the GAP, you just obtain a vector whose dimesion corresponds to the number of classes, quite useless for visualization purposes.

Can someone shed a bit of light about that? Thanks!

shivamchandhok · April 6, 2019, 12:30pm

The link to the notebook is not working.
Can you share it again

maelh · May 4, 2019, 3:16pm

@henripal @dhoa @MicPie
As mentioned in another thread, there is a nice general library that visualizes CNNs using many approaches, which allows for easy comparision (Keras/Tensorflow for now):

Since a couple of you seem to have had success hooking into PyTorch, maybe you could help make innvestigate work with PyTorch/fastai?

Sayak · May 16, 2019, 4:07pm

This worked like a charm. Thank you so much. The code is also readable. Really cannot thank you enough.

muhajir · June 11, 2019, 12:49am

Would you share it again. The link is broken.
Thanks

muellerzr · June 11, 2019, 12:57am

@muhajir

github.com

henripal/maps/blob/master/nbs/big_resnet50-interpret-gradcam-dogs.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Grad CAM exploration\n",
    "\n",
    "We are modifying the lesson 1 notebook to see if we can implement [Grad-CAM](https://arxiv.org/abs/1610.02391). The idea is to obtain a heatmap of the important areas of the images for the classifier.\n",
    "\n",
    "The objective is to eventually use this for the satellite project."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "%matplotlib inline"

This file has been truncated. show original

muhajir · June 11, 2019, 11:17am

Thank you so much!