Labe smoothing via cnn_learner

oo92 · May 27, 2020, 12:16am

Hi.

Is there a way to add loss_func = LabelSmoothingCrossEntropy to a cnn_learner()? This is my code:

learn = cnn_learner(data, models.resnet50, metrics=[accuracy, top_1], callback_fns=ShowGraph)

muellerzr · May 27, 2020, 12:22am

specify it as loss_func=

You also need to pass it in as an instance. IE: LabelSmoothingCrossEntropyLoss()

oo92 · May 27, 2020, 12:34am

Right but how do I add it to my cnn_learner()?

muellerzr · May 27, 2020, 12:49am

Just pass it in with your call to cnn_learner as a parameter.

oo92 · May 27, 2020, 1:03am

I did this:

learn = cnn_learner(data, models.resnet50, metrics=[accuracy, top_1], loss_func = LabelSmoothingCrossEntropy, callback_fns=ShowGraph)

And when I ran

learn.lr_find()
learn.recorder.plot(suggestion=True)

I got

/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py in __getattr__(self, name)
    574                 return modules[name]
    575         raise AttributeError("'{}' object has no attribute '{}'".format(
--> 576             type(self).__name__, name))
    577 
    578     def __setattr__(self, name, value):

AttributeError: 'LabelSmoothingCrossEntropy' object has no attribute 'detach'

muellerzr · May 27, 2020, 1:09am

See what I said in my first comment.

DrC · May 27, 2020, 1:50am

@muellerzr following up to this, if say you have two datasets to use to train but you would like to weight on the training samples of one more than the other, how would you go about that?

Say for instance images are in 2 different dataframes so you concatenate them and use ImageItemList.from_df

muellerzr · May 27, 2020, 2:08am

@oo92 @DrC did some digging in my old repos, check out this notebook here. It has the LabelSmoothing code and examples:

github.com

muellerzr/Practical-Deep-Learning-For-Coders/blob/master/02b_SOTA.ipynb

{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "name": "Lesson_2_SOTA.ipynb",
      "version": "0.3.2",
      "provenance": []
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "accelerator": "GPU"
  },
  "cells": [
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "LUpTOx1uBF48",

This file has been truncated. show original

Note: even though it may not use cnn_learner, you can always pass in a loss_func

muellerzr · May 27, 2020, 2:09am

That’s a tricky question! One potential would be to ensemble two models with said data, and then weigh ones predictions over the other a certain weight. Such as preds_a = preds_a40%, preds_b60% (for example)

Otherwise unless it’s between classes (which sounding like it’s not), I haven’t looked into that particular area before!

DrC · May 27, 2020, 2:15am

Well, they are not separate since a single batch can have some of dataset 1 and some from dataset 2, but the two have different weights associated with them. Kind of like upweighting imbalanced dataset classes but this is intra-datasets… i put it under this thread because i felt this can be handled with some different loss function inherent in fastai/pytorch!
Any ideas would be great