The interpretation of text classification throws an error when used on a pretrained network

miko · March 14, 2019, 10:43am

Mikonapoli. Thanks for doing the work

thousfeet · April 9, 2019, 11:56am

Hi, is there any docs about TextClassificationInterpretation? I can’t find in https://docs.fast.ai/. Thanks a lot!

sgugger · April 9, 2019, 1:30pm

No, this is a new experimental feature developed by @herrmann

bfarzin · April 9, 2019, 4:52pm

May I add this to the docs in the text.learner section? Mirroring the vision.learner?
A small example might be helpful there.

sgugger · April 9, 2019, 4:54pm

By all means! Any PR to make the docs better is more than welcome.

bfarzin · April 9, 2019, 6:35pm

PR is out there!

thousfeet · April 11, 2019, 2:39am

Hi,
I’ve wrote a method called show_top_losses() to enhance TextClassificationInterpretation inspired by plot_top_losses in vision.learn.
This method can creates a tabulation showing the first k texts in top_losses along with their prediction, actual, loss, and probability of actual class.
like this: (on my own dataset)

my code:

    def show_top_losses(self, k:int)->None:
        table_header = ['Text', 'Prediction', 'Actual', 'Loss', 'Probability']
        table_data = []
        tl_val,tl_idx = self.top_losses()
        for i,idx in enumerate(tl_idx):
            tx,cl = self.data.dl(self.ds_type).dataset[idx]
            cl = cl.data
            classes = self.data.classes
            tmp = (self.cut_by_line(tx.text), f'{classes[self.pred_class[idx]]}', f'{classes[cl]}', f'{self.losses[idx]:.2f}', f'{self.probs[idx][cl]:.2f}')
            table_data.append(tmp)
            k -= 1
            if k==0: break
        print(tabulate(table_data, headers=table_header, tablefmt='orgtbl'))

    def cut_by_line(self,text):
        res = ""
        width = 80
        lines = len(text) // width
        if lines == 0:
            res += text
        else:
            for i in range(lines):
                res += text[i * width:(i + 1) * width] + '\n'
            res += text[(range(lines)[-1] + 1) * width:]
        return res

I thought it is useful to me. May I add this to awd_lstm.py？@sgugger

sgugger · April 11, 2019, 2:54am

That looks useful, don’t hesitate to suggest a PR with it!

thousfeet · April 11, 2019, 3:25am

thanks! PR is here

bfarzin · April 11, 2019, 9:57pm

Thank you. I will definitely use this!

kcturgutlu · May 27, 2019, 10:48pm

From: Sequential Jacobian section at https://www.cs.toronto.edu/~graves/preprint.pdf

However it should be stressed that
sensitivity does not correspond directly to contextual importance. For example,
the sensitivity may be very large towards an input that never changes, such
as a corner pixel in a set of images with a fixed colour background, or the
first timestep in a set of audio sequences that always begin in silence,

I believe this explains why xxbos has high sensitivity. Maybe you can ignore it in gradient normalizations to better see the relative sensitivity of the actual tokens.

Esteban · May 27, 2019, 11:58pm

Possibly, but I guess I would expect xxbos to always be highlighted since it starts every sequence. When running on my target data set with longer sequences the special character xxbos was rarely highlighted. But other special characters like xxmaj and xxup were highlighted fairly often. Overall this technique produced some really interesting results.

Skeptic · June 7, 2019, 3:47pm

Can anyone tell me from where to import ‘TextClassificationInterpretation’?

kcturgutlu · June 7, 2019, 6:39pm

It should be under fastai.text.intepret. You may also use an IDE like vscode to search class definitions, I do that whenever I want to look up something

https://docs.fast.ai/text.interpret.html

Skeptic · June 7, 2019, 7:38pm

Thanks for your advice.
I have searched it earlier(before writing here) as you suggested and also in some other modules but its neither in the module you suggested nor at other modules I searched.
I could find ‘ClassificationInterpretation’ but not the ‘TextClassificationInterpretation’.
Again the associated attributes after importing ‘ClassificationInterpretation’ is not found.

kcturgutlu · June 7, 2019, 7:53pm

You miss the interpret module all together. Try doing the latest dev install.

Here is the source code:

github.com

fastai/fastai/blob/master/fastai/text/interpret.py#L35




def show_piece_attn(*args, **kwargs):
    from IPython.display import display, HTML
    display(HTML(piece_attn_html(*args, **kwargs)))


def _eval_dropouts(mod):
        module_name =  mod.__class__.__name__
        if 'Dropout' in module_name or 'BatchNorm' in module_name: mod.training = False
        for module in mod.children(): _eval_dropouts(module)
            
class TextClassificationInterpretation(ClassificationInterpretation):
    """Provides an interpretation of classification based on input sensitivity.
    This was designed for AWD-LSTM only for the moment, because Transformer already has its own attentional model.
    """


    def __init__(self, learn: Learner, preds: Tensor, y_true: Tensor, losses: Tensor, ds_type: DatasetType = DatasetType.Valid):
        super(TextClassificationInterpretation, self).__init__(learn,preds,y_true,losses,ds_type)
        self.model = learn.model


    def intrinsic_attention(self, text:str, class_id:int=None):
        """Calculate the intrinsic attention of the input w.r.t to an output `class_id`, or the classification given by the model if `None`.

Skeptic · June 7, 2019, 9:46pm

Thanks.
Sorry for bothering you, as I still have a question.
Actually, I am running a standalone notebook without Developer Install.
I am just running pip install -U fastai to update the fastai library.
But it is not recognizing fastai.text.interpret. Do you know, if Dev install can only solve this problem?

kcturgutlu · June 7, 2019, 9:48pm

Yes dev install will get you master