Training resnet model then using it for transfer learning as if it were a built in model with pretrained weights

rbunn80130 · August 25, 2019, 9:25pm

I found a forum post with a very similar problem to mine:

Technically, someone does give a solution and it does appear to work, however I suspect it may not be the recommended solution for this particular problem (at least I hope not).

In my specific scenario I am training a resnet model using cnn_learner starting with the pretrained weights, however my dataset is ultrasound images. Even though the pretrained model is a good starting point for training, as you can imagine my final model takes a significant amount of training time to get to the best accuracy. So what I would like to do is save that model somewhere and use it like it was a pretrained resnet model I started with again using cnn_learner. Is there any way to save this model and use it as if it were any other built in pretrained model?

rbunn80130 · August 29, 2019, 9:53pm

So I figured this out for my issue.

def my_resnet(x):
    model = torch.load("my_trained_pytorch_resnet_model")
    all_layers = list(model.children())
    return nn.Sequential(*all_layers[0], *all_layers[1:])

learn = cnn_learner(data, 
                    my_resnet,
                    cut=-2, 
                    split_on=lambda m: (m[0][6],m[1]))

charliec · December 14, 2020, 1:10pm

@rbunn80130 I’m trying to do something similar to this.

Did you get this to work or have a notebook somewhere? I’m super curious about how your resolution worked.

rbunn80130 · December 20, 2020, 11:05pm

other than what I posted above, not really.

@muellerzr - do you have any good approaches to this problem?

muellerzr · December 21, 2020, 12:02am

I do! This notebook about 3/4 of the way down:

github.com

muellerzr/Practical-Deep-Learning-for-Coders-2.0/blob/master/Computer Vision/05_EfficientNet_and_Custom_Weights.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 05 EfficientNet and Custom Pretrained Models\n",
    "\n",
    "This notebook will cover:\n",
    "* Using a `PyTorch` model\n",
    "* Using pre-trained weights for transfer learning\n",
    "* Setting up a `cnn_learner` style `Learner`"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## The Problem:\n",
    "\n",

This file has been truncated. show original

It discusses using custom pkl weights for your model. The function itself is transfer_learn

rbunn80130 · December 21, 2020, 12:15am

Thanks! That’s exactly what I was looking for. You sir, are a genius.

rbunn80130 · December 21, 2020, 4:01pm

It appears that if some linear layers on the head happen to have the same shape as before then those values will be copied over as well. How would you go about resetting the head even if it happened to have the same shape?

For now, I just enumerated the items and stopped copying when the head is reached, but that isn’t a good solution in general obviously.

muellerzr · December 21, 2020, 5:22pm

You could look into init_cnn and see how fastai initializes their models. Basically you would need to reinitialize that last layer