In the Dynamic UNET, why are we creating the middle conv and unet block in eval mode?

jimmiemunyi · March 30, 2022, 8:06am

Here is the code I am referencing, inside the init method of the dynamic unet, we have this two sections:

middle_conv = nn.Sequential(ConvLayer(ni, ni*2, act_cls=act_cls, norm_type=norm_type, **kwargs),
                                    ConvLayer(ni*2, ni, act_cls=act_cls, norm_type=norm_type, **kwargs)).eval()

and

unet_block = UnetBlock(up_in_c, x_in_c, self.sfs[i], final_div=not_final, blur=do_blur, self_attention=sa,
                                   act_cls=act_cls, init=init, norm_type=norm_type, **kwargs).eval()

So I was digging through the code and was confused by this. Why are we calling the .eval() method after creating them?
Don’t we want them in training mode?

matdmiller · April 1, 2022, 6:45am

I believe this is because dummy data is being passed through the model at the time it is initialized to determine input/output shapes the layers and calling .eval() prevents any updates to batch norm stats during this process.

https://pytorch.org/docs/stable/notes/autograd.html#evaluation-mode-nn-module-eval

github.com

fastai/fastai/blob/baed5074dacf0c9a82f93c06afba75ec3b45b945/nbs/15a_vision.models.unet.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "#hide\n",
    "#skip\n",
    "! [ -e /content ] && pip install -Uqq fastai  # upgrade fastai on colab"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "#export\n",

This file has been truncated. show original

jimmiemunyi · April 5, 2022, 7:30pm

Hello
That seems like a plausible reason. However I can’t see the code that switches it back to training mode. Or will this happen automatically during the training loop?

I am assuming training on eval mode will not enable us to update the batchnorm which is undesirable behavior.

matdmiller · April 5, 2022, 9:18pm

I believe that switch is automated during the training loop because eval mode should be turned on as well when your validation set is evaluated every epoch.

jimmiemunyi · April 6, 2022, 9:20am

Awesome. Thanks for the help!