How does get_transforms work?

diskandar · September 27, 2019, 4:26pm

anybody know how does method really work. it does not seem to add more data to the existing dataset. so it just convert some portion of the existing data? how big? Thanks.

muellerzr · September 27, 2019, 4:38pm

Have you read the documentation? And it’s applied over the entire dataset

https://docs.fast.ai/vision.transform.html

diskandar · September 27, 2019, 4:52pm

read through it, mentioned ‘we do small random transformations’
what is small means? what percentage is being transform? that is my questions … why not ADD to the original dataset. From what i see the number of dataset remains the same.

say if you have 1000 dataset, after the tranformation still 1000, say 10% is being transform. Then 100 is ‘new’ tranform dataset. This means we ‘lose’ the original 100 dataset. I don’t think this is a good practice. It is better to ADD on top of the original dataset. In that way you don’t lose any valuable original information.

farid · September 27, 2019, 7:49pm

The transformations are applied on the fly when the DataLoader builds the batch. Most of the tranformations have probabilities of occurrence which means only a percentage of the images is transformed (let’s say 10% of the1000 images in the dataset as you mentionned in your message), and the rest (90%) stay inchanged.

Like you mentionned here above, we are basically throwing 10% of our images when we train our model for 1 epoch.

But remerber that we train our model for a certain number of epochs (each single epoch corresponds to the whole datase). So, let’s say we train it for 20 epochs that means we will have 20,000 images (20 x 1000 images) to be used out of which 2000 are transformed images (20 x 100 images). Thereforefore, there is a good chance that our model has seen almost all the original images plus the extra 2000 transformed images.

If we do not want to do the transformations on the fly, we can create the transformed images and store them on disk before training. I guess we can also combine them with the original images (by chosing a certain" transformed/original" ratio) and train our model. On the other hand, when the transformations are done on the fly the randomness is handled by the fastai library.

diskandar · September 27, 2019, 8:14pm

thank you Farid. How do you know that ? from looking at the code?

so, it is assume that the algorithm has kind of ‘k-fold’ built in? meaning for every epoch it will transform a different subset of the dataset, so at the end (after k epoch) they have seen all the data?

This means if we should not use this for epoch=1 … now i am intrique to see the actual code

farid · September 28, 2019, 2:50am

by following the fastai-v1 and fast-v2-dev source code. Also by watching Jeremy’s Part -2 videos and Fastai v2 code walk-thru videos.

To undestand that is going on wrt the transformations, you can check out the source code file https://github.com/fastai/fastai/blob/master/fastai/basic_data.py and have a look at the DeviceDataLoader class and especially the function __iter__(self), it calls proc_batch(self,b:Tensor)

 def proc_batch(self,b:Tensor)->Tensor:
        "Process batch `b` of `TensorImage`."
        b = to_device(b, self.device)
        for f in listify(self.tfms): b = f(b)
        return b

    def __iter__(self):
        "Process and returns items from `DataLoader`."
        for b in self.dl: yield self.proc_batch(b)

You can see that in proc_batch we iterate through all the transforms self.tfms and apply all of them to the batch recursively (b = f(b))

For your second question (K-fold), AFAIK fastai library does not implement the K-fold Cross Validation. Both your train dataset and validation dataset are fixed. The training is done using the same training dataset.

If you are intrested in implementing the K-Fold in fastai, check out this post

As for epoch=1, training a model needs to be done with more epochs. The number of epochs depends on may factors including checking you training loss, your validation loss, your accuracy, cost of training, etc…

PS: if you are intersted in studing the source code, you can check it on github or even better clone the fastai repo (v1 and/or v2) and use Visual Studio Code if you haven’t already installed it yet or use vim like jeremy does.

Have a great week-end!

Alek · May 19, 2020, 11:59am

I think an easy solution would be to prepare dataset 2 times. In the first epoch do not perform any transformations, then save results (learn.save(‘stage-1’)). Purge everything (easiest: reset kernel) and prepare dataset now performing transformations and load state after 1-st epoch (learn.load(‘stage-1’)).

AvidTeacher · July 26, 2020, 8:17am

Hi Farid,

What percent of data is transformed in the fly as we have not explicitly specified it.
( b = f(b) ) i understand it is done recursively but what percentage.

Thanking you,
Chetankumar

Diaga · August 1, 2020, 11:24am

The randomness is introduced in this Transform class that mostly all transform functions inherit from:

github.com

fastai/fastai/blob/54a9e3cf4fd0fa11fc2453a5389cc9263f6f0d77/fastai/vision/image.py#L452-L477


class Transform():
    "Utility class for adding probability and wrapping support to transform `func`."
    _wrap=None
    order=0
    def __init__(self, func:Callable, order:Optional[int]=None):
        "Create a transform for `func` and assign it an priority `order`, attach to `Image` class."
        if order is not None: self.order=order
        self.func=func
        self.func.__name__ = func.__name__[1:] #To remove the _ that begins every transform function.
        functools.update_wrapper(self, self.func)
        self.func.__annotations__['return'] = Image
        self.params = copy(func.__annotations__)
        self.def_args = _get_default_args(func)
        setattr(Image, func.__name__,
                lambda x, *args, **kwargs: self.calc(x, *args, **kwargs))

    def __call__(self, *args:Any, p:float=1., is_random:bool=True, use_on_y:bool=True, **kwargs:Any)->Image:
        "Calc now if `args` passed; else create a transform called prob `p` if `random`."
        if args: return self.calc(*args, **kwargs)
        else: return RandTransform(self, kwargs=kwargs, is_random=is_random, use_on_y=use_on_y, p=p)

This file has been truncated. show original

The random probabilities for image transforms using get_transforms are specified in its definition using p_affine keyword argument.

github.com

fastai/fastai/blob/master/fastai/vision/transform.py#L308-L321


def get_transforms(do_flip:bool=True, flip_vert:bool=False, max_rotate:float=10., max_zoom:float=1.1,
                   max_lighting:float=0.2, max_warp:float=0.2, p_affine:float=0.75,
                   p_lighting:float=0.75, xtra_tfms:Optional[Collection[Transform]]=None)->Collection[Transform]:
    "Utility func to easily create a list of flip, rotate, `zoom`, warp, lighting transforms."
    res = [rand_crop()]
    if do_flip:    res.append(dihedral_affine() if flip_vert else flip_lr(p=0.5))
    if max_warp:   res.append(symmetric_warp(magnitude=(-max_warp,max_warp), p=p_affine))
    if max_rotate: res.append(rotate(degrees=(-max_rotate,max_rotate), p=p_affine))
    if max_zoom>1: res.append(rand_zoom(scale=(1.,max_zoom), p=p_affine))
    if max_lighting:
        res.append(brightness(change=(0.5*(1-max_lighting), 0.5*(1+max_lighting)), p=p_lighting))
        res.append(contrast(scale=(1-max_lighting, 1/(1-max_lighting)), p=p_lighting))
    #       train                   , valid
    return (res + listify(xtra_tfms), [crop_pad()])