Is it possible to specify the amount of data augmentation

aloksaan · July 31, 2018, 5:36pm

How much data is augmented in the function tfms_from_model? If yes, then is it possible to override it?

KarlH · July 31, 2018, 6:10pm

There are some standard transformation functions in transforms.py. You can choose your own parameters for these functions and pass them to the tfms_from_model function.

augs = [
    RandomFlip(),
    RandomLighting(0.2, 0.2),
    RandomRotate(20)
    ]

tfms = tfms_from_model(arch, sz, aug_tfms = augs)

I haven’t tried it but I imagine you could write your own augmentation functions and pass them in if you wanted, as long as it works with OpenCV images.

aloksaan · August 1, 2018, 3:52pm

Thanks Karl,
but do you know what percentage of images are augmented. I couldn’t find any parameters for it

KarlH · August 1, 2018, 5:49pm

Each augmentation function has a probability parameter/parameters that can be passed in. Random rotate defaults to 0.75, random flip defaults to 0.5. You can look at the code in transforms.py. For each image in your dataset, each augmentation function has some probability of being applied.

github.com

fastai/fastai/blob/master/fastai/transforms.py

from .imports import *
from .layer_optimizer import *
from enum import IntEnum

def scale_min(im, targ, interpolation=cv2.INTER_AREA):
    """ Scale the image so that the smallest axis is of size targ.

    Arguments:
        im (array): image
        targ (int): target size
    """
    r,c,*_ = im.shape
    ratio = targ/min(r,c)
    sz = (scale_to(c, ratio, targ), scale_to(r, ratio, targ))
    return cv2.resize(im, sz, interpolation=interpolation)

def zoom_cv(x,z):
    """ Zoom the center of image x by a factor of z+1 while retaining the original image size and proportion. """
    if z==0: return x
    r,c,*_ = x.shape

This file has been truncated. show original

aloksaan · August 2, 2018, 4:38am

Thanks Karl, really appreciate your quick responses.
My question is little different. let us say i have 100 training images and if switch ON augmentation how many augmented images will be created (I know they are not real files). Also are there any guidelines on "how much " augmentation?

digitalspecialists · August 2, 2018, 8:51am

My understanding is that each time a training image is used it is randomly transformed. If you have flip, lighting, and rotation transforms, only one is randomly chosen per image per epoch, and is performed with a random magnitude within configured thresholds. Take a look at the source code if you want to be sure, or to change behaviour.

KarlH · August 2, 2018, 11:10pm

I don’t think that sort of question applies here. The augmentation functions are applied every time an image is passed from the dataloader to the model during training. So in a single epoch the model trains on one augmented version of each image in the data set.