Implementing WideResNet from scratch

amqdn · March 7, 2019, 8:26pm

Hey, all!

Today I’ve got an implementation of WideResNet for you. Trying to figure out what these papers are saying and how to implement them in fast.ai is really teaching me a lot. Some takeaways:

Darknet really is fast – really
Though I did not test this myself, if I extrapolate from WRN’s performance in this notebook, I can see how it would perform better against fresh ResNets with a similar number of parameters; based on this and other results from the paper, I will seriously consider widening a network before deepening if I ever feel the need to increase the number of layers beyond 50
As you can see, train_loss consistently hovers above valid_loss in later epochs, while valid_loss continues to drop; my suspicion is that the dropout layers are really helping the network generalize – bears further investigation

Enjoy!

gist.github.com

https://gist.github.com/amqdn/211b84d93bf05becbba89ecbca2ba20c

fastai-wideresnet-mnist.ipynb

{
  "cells": [
    {
      "metadata": {},
      "cell_type": "markdown",
      "source": "## WideResNet - MNIST"
    },
    {
      "metadata": {},
      "cell_type": "markdown",

This file has been truncated. show original