Lesson 11 discussion and wiki

naoya · April 11, 2019, 3:50am

With a resize zoom transform, what if you lose the object of interest in the process, effectively changing the label?

PierreO · April 11, 2019, 3:51am

I’m not sure I understand what you mean. There’s different classes for different kinds of transformation (TfmPixel, TfmCrop) and those classes have an order between them that is fixed in fastai ? What about different instances of TfmPixel transforms, how are they ordered between themselves?

cloutiertyler · April 11, 2019, 3:51am

If that’s such a common transformation, why not bake some of these transformations into the network architecture?

drscotthawley · April 11, 2019, 3:51am

Doesn’t cropping the fish out of the image (his top left example) invalidate the appropriateness of the “tench” label?

sgugger · April 11, 2019, 3:51am

You have to be careful not to apply too much zoom. Yet again, this RandomResizeCrop is the most commonly used technique on Imagenet, with great success, and it often loses the object of interest.

amanmadaan · April 11, 2019, 3:52am

The model ends up learning that a middle aged person with a smile must be holding a fish.

sgugger · April 11, 2019, 3:52am

They all have the same order, because we never had the need to take care of that. I’m not sure what example you have in mind of a kind of data augmentation requires to go after or before another.

JoshVarty · April 11, 2019, 3:53am

Typically the operations you want to bake into your architecture are ones for which you want to compute gradients. So there’s no point in spending time keeping track of gradients we might not need.

Another reason is that you might not want to make these transformation at test/inference time. You might want to train with the augmentations, but not necessarily use them when you’ve deployed your model to production.

Another reason is that the transformations might depend on domain. We can use ResNet on both ImageNet and MNIST. However while we can flip ImageNet images horizontally, we probably don’t want to flip the digits in MNIST.

devforfu · April 11, 2019, 3:53am

Didn’t know about torch.solve

sgugger · April 11, 2019, 3:53am

It’s two weeks old, so it’s understandable.

lucaslooper · April 11, 2019, 3:53am

Can it also not be used to clean the data? I’m thinking about the nature conservancy fisheries monitoring kaggle competition.

ThomM · April 11, 2019, 3:53am

Right, not at inference. But I’m just kind of wondering out loud that the intuition of the point of doing his data augmentation is to make your training data go further, sort of artificially “take more photos” (because “more data always wins”).

nswitanek · April 11, 2019, 3:54am

Has someone written a tutorial or example of using the tensorboard integration? I’ve found the code but very little documentation of this functionality.

mediocrates · April 11, 2019, 3:54am

What about the black pixels in these transforms where you change the perspective, or tilt the picture in some way to where it’s no longer fills up the the original rectangle?

sgugger · April 11, 2019, 3:54am

The motivation has been explained at length during part 1, that’s why Jeremy skipped it tonight. It’s to artificially have more training data, yes.

sgugger · April 11, 2019, 3:55am

We usually use reflection padding to fill the black pixels.

maxim.pechyonkin · April 11, 2019, 3:55am

Is perspective shift augmentation fast enough given Imagenet constraints discussed above?

SHAR1 · April 11, 2019, 3:55am

If I want to use a particular transformation offered by opencv, how do i go about it? Is it better to convert that particular transformation into PIL in all cases?

PierreO · April 11, 2019, 3:56am

Right, I don’t have anything specific in mind for transforms. It’s just that we’ve been using _order a few times now (for callbacks too for example) and I was wondering how transparent and usable it is. But if it’s implemented differently in fastai from what’s shown here and/or the order actually doesn’t matter that much I may be thinking too much about it

sgugger · April 11, 2019, 3:56am

No. The person that added the functionality doesn’t have the time to document it right now, so if you want to volunteer to do that, it would be much appreciated.