Fastai_v1, adding features

sgugger · September 26, 2018, 2:41am

This topic is to suggest any new feature that should be added to the library. Development is limited to debug for now, as we are focused on the documentation, but ideas are definitely welcome!

Here are a few things we didn’t get a chance to implement yet:

save/load the optimizer state on top of the model
data augmentation when the target is a list of points
TTA for segmentation or bounding boxes
orders for callbacks, to solve this in particular
GAN implementation

TheShadow29 · September 26, 2018, 3:13am

I think almost everything can be handled via the wonderful callback api. I am not sure if this has any easy solution with callbacks, but how do we train two models simultaneously like in gans?

A not so elegant solution I have found is to have one model with both the discriminator and generator and have some property to decide which one to execute and use callbacks to change that property. Would love to know if there is something simpler.

jeremy · September 26, 2018, 3:42am

I hope to add GAN support to fastai by the time we get to the mid-point of the course, including CycleGAN. Ideally we’ll add it as a loss function, although I haven’t fully thought it through yet.

TheShadow29 · September 26, 2018, 8:40pm

I think we should also have a dataclass which supports something like pascal multi (I couldn’t find such a notebook in dev_nb). To be precise, it should take a csv file with
image_id bbox cls for each bounding box (multiple bbox goes to different rows).

In the previous version, it was done via ConcatLibDataset, however that requires pre-processing of csv files, and I believe the image_id bbox cls is a standard format and it would be nice to support that.

jeremy · September 26, 2018, 8:42pm

We’ll be adding that after Tuesday. Some initial steps for it are already in place.

TheShadow29 · September 26, 2018, 10:22pm

Not exactly related to fastai, but fastprogress. Not a very crucial thing but currently the output looks like this:

Would be great if the values are in a more tabular format, like having only the value of training loss below training loss. It is slightly tough on the eye to decipher the values.

Though at the end of it becomes tabular

sgugger · September 26, 2018, 10:34pm

I’ve tried my best, but the widget is in HTML, so the spaces are all messed up. Since it’s only during training and the final output is clean, we decided it wasn’t worth spending more time on for now.

TheShadow29 · September 26, 2018, 11:03pm

Fair point.

matttrent · September 27, 2018, 1:14am

Can you wrap each line in a <pre> or <code> block?

sgugger · September 27, 2018, 1:17am

<pre> didn’t work. I’ll try <code> to see if it does, thanks for the suggestion!

TheShadow29 · September 27, 2018, 4:39pm

Another interesting addition could be a good debugger. I see there is a debugger defined here: https://github.com/fastai/fastai_pytorch/blob/feae73216b36e27effffd649664c533dc628385f/fastai/layers.py#L69. However, it seems to just add set trace in the forward pass.

This post https://medium.com/@keeper6928/how-to-unit-test-machine-learning-code-57cf6fd81765 has some outlines to unit test deep learning modules. While many of the problems that have been stated would be easy to find if one uses a good IDE, there are still quite a few bugs which go un-noticed. A few examples could be vanishing/exploding gradients, models not being initialized properly, loss becoming negative etc.

paws · September 27, 2018, 5:40pm

Does the fastai library have place for VAEs/Autoencoders and autoregressive model implementations?

jeremy · September 27, 2018, 6:38pm

Autoencoders are just a custom head. Nothing to support variational models yet. For autoregressive models you can use an RNN.

brian2005 · September 28, 2018, 5:17am

Thank you for the wonderful courses, Jeremy, Rachel, and Sylvain!

I am very interested in applying what I have learned from the fast.ai courses to medical imaging. In such applications, I often face inputs with an arbitrary number of channels, for example, 1-channel gray images, N-channel 3D data set, M-channel multi-parametric maps, K-channel multi-contrast images, L-channel complex-part and real-part of data acquired from multiple receivers, and etc. (where N, M, K, L, … are any arbitrary integer such as 1, 2, 3, 4, 5, … and up to 128 or even 256).

I was wondering if the fast.ai library could be supporting data with an arbitrary number of channels input data?

Thanks a lot!

jeremy · September 28, 2018, 6:14am

Sure. As you try it out, let us know if you find places where it’s making assumptions about # channels. (Other than model definitions, of course).

brian2005 · September 28, 2018, 4:05pm

Yes, I will Jeremy

Caleb · October 4, 2018, 6:42pm

What were your initial thoughts on how to implement tta for segmentation?

fredguth · October 4, 2018, 10:24pm

To implement TTA to segmentation, you need to keep the transformation params and a way of applying the inverse of the transformation. There may be transformations that are not inversible.

Caleb · October 4, 2018, 11:02pm

So that would entail adding in a separate set of only inversible transforms for the test dl instead of utilizing the validation ones, modifying the test dl to capture the transform parameters, and then the tta function to do the inversion on the prediction?

Here was another thread on the topic:

And also, not all of them would need to be inversed, only those that change pixel locations. Crops probably wouldn’t work.

fredguth · October 5, 2018, 6:29pm

Maybe there is an easier way, but that was what I thought was needed when I had this problem. If you want, we could work together on trying to solve this.

Here is another discussion on subject: https://github.com/fastai/fastai/issues/646