Deep Learning na Unb (Brasília) - Parte 1 - Lição 3

pierreguillou · October 30, 2019, 12:05am

Lesson 3 - Multi-label, Segmentation, Image Regression, and More… (06/11/2019 - UnB - Brasília)

Este tópico permite que os membros do Grupo de IA da UnB (Brasília) estudem coletivamente (em reuniões presenciais e on-line) a lição 3 (parte 1) do curso fastai, mas de um jeito aberto para ajudar também pelas questões, respostas e pelos recursos publicados todos os leitores em português interessados em DL.

Lesson resources

Lesson notes from @PoonamV
Detailed lesson notes by @hiromi
The notebooks for this lesson require fastai 1.0.21 or later . Please conda install -c fastai fastai (or the equivalent for your platform), and of course remember to git pull to get the latest notebooks
Notebooks:
Lesson 3 in-class discussion
Lesson 3 Advanced Discussion ✅
Links to different parts in video by @melonkernel

Other resources

Useful online courses for ML background:
– Introduction to Machine Learning for Coders taught by @jeremy
– Machine Learning taught by Andrew Ng (coursera)
Video Browser with Searchable Transcripts Password: deeplearningSF2018 (do not share outside the forum group) - PRs welcome.
Quick and easy model deployment using Zeit Now
Introduction to Kaggle API in Google Colab (Part-I) tutorial by @mmiakashs
Data block API
Python partials
MoviePy Python module for video editing mentioned by @rachel
WebRTC example for web video from @etown
Nov 14 Meetup (wait list) Conversation between Jeremy Howard and Leslie Smith
List of transforms in vision.transform package
References in video
Visual Explanation of The Universal Approximation Theorem for neural networks by Michael Nielsen

For next week

Gradient descent spreadsheet

pierreguillou · November 4, 2019, 7:30pm

Ementa (06/11/2019 - UnB - Brasília)

[ 10mn ] Maneiras de otimizar o trabalho em grupo online
- Revisão de perguntas publicadas no fórum (como aumentar suas chances de obter uma resposta útil)
- Lista de nomes de usuário no fórum fastai dos membros do Grupo IA da UnB
- Grupo Whatsapp do Grupo
[ 15mn ] Metodologia sobre como criar e apresentar um projeto de IA viável (exemplos com os projetos da equipe da @izzywho e do @thiagodma - curso do Andrew Ng: IA para todos)
[ 10mn ] Organização da conferência de dezembro (data, horário, lugar, responsável geral, responsável da logística, responsável da comunicação, mentor dos projetos = Pierre)
[ 5mn ] Site do grupo: qual objetivo? (exemplos de 2 sites IA.BSB e ensina.ai)
[ 15mn ] Revisão sobre o uso do GCP/Colab pelo @thiagodma
Novos posts desde a turma anterior:
- [ 10mn ] Primeira apresentação (Classificador de Faixas Etárias do Vicente Moraes)
- [ 10mn ] Segunda apresentação
[ 10mn ] Pontos-chave da turma anterior
[ 1h20mn ] Lição 3 (veja “Videos timeline”)
- Multi-label classification - lesson3-planet.ipynb
- DataBunch - data_block.ipynb
- Segmentation or pixels classification - lesson3-camvid.ipynb
- Regression with images - lesson3-head-pose.ipynb
- NLP with texts classification - lesson3-imdb.ipynb
[ 15mn ] Oficina prática
[ 0mn ] Fotos da aula

Videos timeline

Examples of web apps people have built during the week 3:36
Multi-label classification with Planet Amazon dataset 9:51
Downloading the data through Kaggle Api 11:02
Multiclassification 14:49
Dataset (PyTorch) 18:30
DataLoader (PyTorch) 20:37
Data block API examples 23:56
Planet 26:01
CAMVID 26:38
COCO 27:41
Creating satellite image data bunch 29:35
Creating multi-label classifier 35:59
Python3 partial [39:17]
How to choose good learning rates [48:50]
Making the model better (Transfer Learning) 50:30
Segmentation example: CamVid [56:31]
Image Segmentation [1:03:05]
Creating a data bunch [1:05:53]
Training [1:09:00]
U-Net [1:16:24]
A little more about learn.recorder [1:18:54]
What you are looking for in plot_losses [1:25:01]
Go big [1:26:16]
Another trick: Mixed precision training [1:30:59]
Regression with BIWI head pose dataset [1:34:03]
Create a regression model [1:38:59]
IMDB [1:41:07]
Universal approximation theorem [1:52:27]
Wrapping up [2:01:19]

Recursos

Exercícios até a próxima aula

Multi-label classification
Segmentation or pixels classification
Regression with images

pierreguillou · November 4, 2019, 9:35pm

From The data block API (fastai doc)

The data block API lets you customize the creation of a DataBunch by isolating the underlying parts of that process in separate blocks, mainly:

Where are the inputs and how to create them?
How to split the data into a training and validation sets?
How to label the inputs?
What transforms to apply?
How to add a test set?
How to wrap in dataloaders and create the DataBunch ?

Each of these may be addressed with a specific block designed for your unique setup. Your inputs might be in a folder, a csv file, or a dataframe. You may want to split them randomly, by certain indices or depending on the folder they are in. You can have your labels in your csv file or your dataframe, but it may come from folders or a specific function of the input. You may choose to add data augmentation or not. A test set is optional too. Finally you have to set the arguments to put the data together in a DataBunch (batch size, collate function…)

The data block API is called as such because you can mix and match each one of those blocks with the others, allowing for a total flexibility to create your customized DataBunch for training, validation and testing. The factory methods of the various DataBunch are great for beginners but you can’t always make your data fit in the tracks they require.

pierreguillou · November 4, 2019, 9:39pm

From “Using the fastai Data Block API”

In fastai the data-containing object that we need to feed to a neural network is called a DataBunch . This is called a ‘bunch’ because it bunches together several PyTorch classes into one.

In PyTorch there are two primary data objects:

the DataSet (which contains all of the data items together with their associated label(s)),
and the DataLoader (which gives chunks of the items in the DataSet to the model in ‘batches’ ).

For a typical supervised learning problem we will want a ‘training set’ and a ‘validation set’, with a separate DataSet and DataLoader for each. (as well as an optional ‘test set’, which we will ignore here for simplicity) All of these are bundled up into the fastai DataBunch !

pierreguillou · November 4, 2019, 10:02pm

1/3 - PyTorch Dataset

(from Dataset (PyTorch) 18:30)

Although PyTorch says “in order to tell PyTorch about your data, you have to create a dataset”, it doesn’t really do anything to help you create the dataset. It just defines what the dataset needs to do. In other words, the starting point for your data is something where you can say:

What is the third item of data in my dataset (that’s what getitem does)
How big is my dataset (that’s what len does)

pierreguillou · November 4, 2019, 10:11pm

2/3 - PyTorch DataLoader

(from DataLoader (PyTorch) 20:37)

Now a dataset is not enough to train a model. The first thing we know we have to do, if you think back to the gradient descent tutorial last week is we have to have a few images/items at a time so that our GPU can work in parallel. Remember we do this thing called a “mini-batch”? Mini-batch is a few items that we present to the model at a time that it can train from in parallel. To create a mini-batch, we use another PyTorch class called a DataLoader.

pierreguillou · November 4, 2019, 10:15pm

3/3 - DataBunch

(from DataBunch (fastai) 21:59)

It still isn’t enough to train a model, because we’ve got no way to validate the model. If all we have is a training set, then we have no way to know how we’re doing because we need a separate set of held out data, a validation set, to see how we’re getting along.

For that we use a fastai class called a DataBunch. A DataBunch is something which binds together a training data loader ( train_dl ) and a valid data loader ( valid_dl ).

pierreguillou · November 5, 2019, 9:14pm

Está procurando datasets para treinar seus modelos de Deep Learning?
Fast.ai Datasets!

pierreguillou · November 6, 2019, 3:02pm

Quer entender melhor o que faz a função learn.fit_one_cycle() que permite de treinar um modelo de Deep Learning? Leia "A little more about learn.recorder"

pierreguillou · November 6, 2019, 8:37pm

Wrapping up of the lesson 3 [2:01:19]

What have we looked at today? We started out by saying it’s really easy now to create web apps. We’ve got starter kits for you that show you how to create web apps, and people have created some really cool web apps using what we’ve learned so far which is single label classification.

But the cool thing is the exact same steps we use to do single label classification, you can also do to:

Multi-label classification such as in the planet dataset.
Image segmentation.
Any kind of image regression.
NLP classification.
and a lot more.

In each case, all we’re actually doing is:

Gradient descent
Non-linearity

Universal approximation theorem tells us it lets us arbitrarily accurately approximate any given function including functions such as:

Converting a spoken waveform into the thing the person was saying.
Converting a sentence in Japanese to a sentence in English.
Converting a picture of a dog into the word dog.

These are all mathematical functions that we can learn using this approach.

pierreguillou · November 10, 2019, 1:37pm

Understanding Learning Rates and How It Improves Performance in Deep Learning

pierreguillou · November 10, 2019, 1:51pm

Teste sua compreensão do funcionamento e treinamento de redes neurais com a prova 1 do Programa de Mestrado/Doutorado em Ciência da Computação (Disciplina de Redes Neurais Profundas - 2019/2) do Instituto de Informática da Universidade Federal de Goiás (UFG).
(cc Professor Anderson da Silva Soares)

pierreguillou · November 11, 2019, 11:32pm

Fotos da turma da lição 3

allansouza · November 20, 2019, 1:00pm

Alguém já teve esse problema?

As soluções que estou encontrando são todas pra pessoas que fizeram o regressor em pytorch, ai pra replicar a solução teria que alterar o código fonte do fastai.

pierreguillou · November 20, 2019, 2:09pm

Bom dia @allansouza. Sem ver o seu notebook, é difícil ajudá-lo.
Os rótulos do seu dataset têm de ter o tipo scalar type long e não float segundo a mensagem de erro.