WeightDropout

rohit_gr · December 24, 2018, 3:44pm

Why are we using the raw weights saved during the beginning of the training but not the updated weights after we forward pass through every batch?

github.com

fastai/fastai/blob/f7d97d19487119c18f4ec788883890a8ca321602/fastai/text/models.py#L39


    for layer in self.layer_names:
        #Makes a copy of the weights of the selected layers.
        w = getattr(self.module, layer)
        self.register_parameter(f'{layer}_raw', nn.Parameter(w.data))
        self.module._parameters[layer] = F.dropout(w, p=self.weight_p, training=False)


def _setweights(self):
    "Apply dropout to the raw weights."
    for layer in self.layer_names:
        raw_w = getattr(self, f'{layer}_raw')
        self.module._parameters[layer] = F.dropout(raw_w, p=self.weight_p, training=self.training)


def forward(self, *args:ArgStar):
    self._setweights()
    with warnings.catch_warnings():
        #To avoid the warning that comes because the weights aren't flattened.
        warnings.simplefilter("ignore")
        return self.module.forward(*args)


def reset(self):
    for layer in self.layer_names:

sgugger · December 26, 2018, 10:36am

With the line 32, the raw weights are registered as a parameter of the network. They are thus updated during the optimizer step.

rohit_gr · December 26, 2018, 11:35am

Ok, thanks.

Perkins · March 23, 2020, 1:28pm

Weight dropout randomly drops individual weights in the weight matrices at each training step. Intuitively, this is dropping connections between layers, forcing the network to adapt to a different connectivity at each training step.