New coordinate transforms pipeline

prabu · July 29, 2018, 7:37am

Will elastic distortions also be done directly on the torch tensors (if so how will the grid flow be generated in a device agnostic way)? Will piecewise affine transforms of an image also be considered? Thanks for all the goodies so far and the goodies currently baking …

sgugger · July 29, 2018, 3:35pm

We’re running tests to see if our implementation is faster or not on a wide range of tasks. Torchvision is slightly slower than opencv, in the few tests I did.

sgugger · July 29, 2018, 3:38pm

We’ve not implemented that yet, but we’ll be looking at it. All the functions we use work on the CPU or the GPU (mainly affine_grid and grid_sampler) so even if it ends up being on one device in fastai_v1 (for now we’re mainly looking at the CPU), it’ll be easy to adapt it to another.

jeremy · July 31, 2018, 1:32am

It’s a little early to say anything definitive about speed, since Soumith has been kind enough prioritise optimizing stuff that we need for performance in fastai - so for instance they just added a PR that optimizes grid_sample by 10x, and there’s more to come.

jeremy · July 31, 2018, 1:34am

Yes that’s the plan. The grid flow generation will be done in a similar way to the current affine matrix generation (that is, it’ll be put on the same device as the image, when it’s used).

prabu · July 31, 2018, 2:37am

Thanks for the reply Sylvain and Jeremy.

ranakj · August 1, 2018, 4:38am

Thank you so much for sharing this post, enabled me to stumble upon the fast ai dev category as well , also great summary. What do you guys think about libraries such as:

github.com

mdbloice/Augmentor/blob/master/README.md

![AugmentorLogo](https://github.com/mdbloice/AugmentorFiles/blob/master/Misc/AugmentorLogo.png)

Augmentor is an image augmentation library in Python for machine learning. It aims to be a standalone library that is platform and framework independent, which is more convenient, allows for finer grained control over augmentation, and implements the most real-world relevant augmentation techniques. It employs a stochastic approach using building blocks that allow for operations to be pieced together in a pipeline.

[![PyPI](https://img.shields.io/badge/Augmentor-v0.2.2-blue.svg?maxAge=2592000)](https://pypi.python.org/pypi/Augmentor)
[![Documentation Status](https://readthedocs.org/projects/augmentor/badge/?version=master)](https://augmentor.readthedocs.io/en/master/?badge=master)
[![Build Status](https://travis-ci.org/mdbloice/Augmentor.svg?branch=master)](https://travis-ci.org/mdbloice/Augmentor)
[![License](http://img.shields.io/badge/license-MIT-brightgreen.svg?style=flat)](LICENSE.md)
[![Project Status: Active – The project has reached a stable, usable state and is being actively developed.](http://www.repostatus.org/badges/latest/active.svg)](http://www.repostatus.org/#active)
[![Supported Python Versions](https://img.shields.io/badge/python-2.7%2C%203.3--3.6-blue.svg)](https://pypi.python.org/pypi/Augmentor)
[![Binder](https://mybinder.org/badge.svg)](https://mybinder.org/v2/gh/4QuantOSS/Augmentor/master)

## Installation

Augmentor is written in Python. A Julia version of the package is also being developed as a sister project and is available [here](https://github.com/Evizero/Augmentor.jl).

Install using `pip` from the command line:

```python
pip install Augmentor

This file has been truncated. show original

What other data augmentation techniques are you looking into implementing? Anywhere we can see a list of whats next? Little curious about the testing process, do you augment -> train -> measure accuracy, loss , etc? if someone wants to test too, would that be the process?

sgugger · August 1, 2018, 1:28pm

For now we are creating the general pipeline with classic data augmentation techniques (probably all what Augmentor offers will be there in the end). Not necessarily all the transforms will be there at the first release, but we’re making it very easy for anyone to add a new transform.
As for the testing, we compare it gets the same accuracy on trainings of CIFAR-10, imagenet, dogs and cats etc… as when we use data augmentation from torchvision/PIL/opencv.

lesscomfortable · August 2, 2018, 1:09pm

Hey, @sgugger thanks for the great explanation.

You state that:

I don’t quite understand this. Are you talking about the pixels that don’t fall exactly in the grid? Don’t we just crop these out in Step 2.5?

There must be something I am not getting right. Thanks!

sgugger · August 2, 2018, 1:19pm

Padding is, when in the zone of your image (decided by cropping) there is a pixel value that’s out of the bounds of the input picture (so < -1 or > 1 with pytorch conventions). The way we choose a value for them (we have to decide something since we can’t take anything inside the image) is the different ways I explained.
To see what padding does, do a 30 degrees rotation of a square picture ;-).

lesscomfortable · August 8, 2018, 8:31pm

@jeremy @sgugger

Hey guys,

Do you think we should include this in the dev_nb as prose? I know the idea was to use it in documentation but did you think of including it in the dev notebooks? (sorry I don’t know if dev_notebooks are the only source of documentation).

sgugger · August 9, 2018, 6:37am

It’ll be included in the documentation (which is going to be notebooks, but different from the dev notebooks). As for the dev notebooks, Jeremy is going to use them as support for the second part of the course, so I’m guessing that he’ll explain what’s in this post during one of the lessons.
Maybe a summary can be included in the notebook, but don’t put the whole thing I think.

lesscomfortable · August 9, 2018, 12:01pm

Yeah that’s what I thought. Thanks!

TheShadow29 · September 13, 2018, 9:04pm

@sgugger could you point me to where the transformations are being done in fastai_v1? I can see in the notebooks at places, but not in the fastai folder. Thanks

sgugger · September 13, 2018, 9:23pm

They’re not in the fastai module yet, just in the notebooks.