Deep Learning Brasília - Revisão (lições 1, 2, 3 e 4)

pierreguillou · April 16, 2018, 2:31pm

Revisão

Revisão do conteúdo ministrado até aqui, tanto teórico quanto prático, com apresentações do @pierreguillou, @saulberardo e @thgomes.

Caixa de ferramentas do Deep Learning

Exercício 1

[UPDATE 02/06/18] Use este Juptypter notebook : https://github.com/piegu/fastai-projects/blob/master/lesson1-quick.ipynb
Goal : write the code without copy/paste
How to get it ? In a terminal on your GPU machine (local or online) :
wget --no-check-certificate https://raw.githubusercontent.com/piegu/fastai-projects/master/lesson1-quick.ipynb
ou
git clone https://github.com/piegu/fastai-projects.git

Exercício 2

[UPDATE 21/04/18] Use este Juptypter notebook : https://github.com/piegu/fastai-projects/blob/master/lesson1-DogBreed.ipynb
Goal : understand the code
How to get it ? In a terminal on your GPU machine (local or online) :
wget --no-check-certificate https://raw.githubusercontent.com/piegu/fastai-projects/master/lesson1-DogBreed.ipynb
ou
git clone https://github.com/piegu/fastai-projects.git

Pontos chaves

Fonte : Wiki thread: Intro workshop

Terminal (it is an interface in which you can type and execute text based commands)
GPU
Git
Python
Python environment
Jupyter notebook (1h45m54s : video do @jeremy sobre como usar um Jupyter Notebook + Jupyter Notebook Commands & Shortcuts)
O nosso grupo
Deep Learning (What you need to do deep learning)
Pytorch (A practitioner’s guide to PyTorch)
Kaggle (script kaggle-cli to download:upload images)

1) Terminal

Test um terminal online
Ubuntu (terminal Linux no Windows 10)
Cygwin (terminal (quase) Linux no Windows exceto 10)
iTem2 (Mac) : melhor terminal quanto ao terminal incluido num Mac
HomeBrew (brew num terminal Mac = apt num terminal Windows)
Tmux (multiple terminal windows : A Quick and Easy Guide to tmux)
Editor num terminal linux
– Vim (https://benmccormick.org/2014/07/14/learning-vim-in-2014-configuring-vim/)
– Nano (https://www.hostinger.com.br/tutoriais/como-instalar-editor-de-texto-nano/)
– Cat (https://www.computerhope.com/unix/ucat.htm)
50 Most Frequently Used UNIX / Linux Commands (With Examples)
– %%bash (followed by a list of commands) (run all the listed bash commands like in a Terminal but do nt move the path after running all theses commands)
– ls (what is inside a folder)
– ls -l (what is inside a folder with details about files size, rights and symlinks)
– ls -alt (what is inside a folder with details about files size, rights and symlinks, even cached files and organized by date)
– cd (go to)
– pwd (where I am)
– history (list of all my commands)
– alt enter (full screen)
– up/down arrows to get commands already typed
– wget (download a file)
– mkdir (create a folder)
– mv (move to another folder)
– find path -name 'file_name' (find a file with its name in all subfolders of a path)
– grep -r 'text' path (find a text in all files of all subfolders of a path)
– bash (run a bash script *.sh)
– source .bashrc
– cd ~ (go to your home in Linux)
– cd /mnt/ (go to your home in Windows)
– which python (get the path to python in use)
– source activate xxx (activate a virtual environment)
– jupyter notebook (launch a jupyter notebook)

2) GPU (Graphic Processing Units)

GPU local

Howto: installation on Windows

GPU online

GPU NVIDIA usa CUDA que é uma linguagem de programação dos GPUs usada pela maioria das bibliotecas de DL (tensorflow, pytorch…).
pytorch installs automatically CUDA.
Crestle
Paperspace (credit of 15$)
Clouderizer + Google Colab (FREE !!!)
Google Cloud Platform (credit of 300$)
Amazon Web Services (AWS)

Paperspace (credit of 15$)

Paperspace setup
Use fastai template, choose P4000 + Public IP + Promo code FASTAI15 ou FASTAI3BDG (15$ credit) + Auto-shutdown (1h)
Launch + use default password in paperspace email
Change password in terminal by using the command : passwd
update conda by using the command : conda update --all
update fastai files by using the 2 commands : cd fastai + git pull
update fastai library by using the command (in the fastai folder) : conda env update
don’t forget to switch off your paperspace machine !!! by using the command : sudo shutdown -h now

Clouderizer + Google Colab (FREE !!!)

Google Cloud Platform (credit of 300$)

Ultimate guide to setting up a Google Cloud machine for fast.ai version 2

Amazon Web Services (AWS)

3) Git

Git is a version control system that allows to work on a project with checkpoints backup.
GitHub is a Git repository hosting service that allows to share your project (Github guide).
Main commands :
– git clone (ex: git clone https://github.com/fastai/fastai.git)
– git pull

4) Python

You must know list comprehension and dictionary comprehension (MAP + FILTER), numpy and pandas
Python/ Numpy/ Scipy/ Matplotlib Combined Demo with code from excellent Stanford cs231 course
A Byte of Python
Learn Python The Hard Way (Frankly the easier and consistent way)
Solo Learn Python
The HitchHacker’s Guide to Python (Best Practice Handbook)
DataCamp (an interactive crash-course on Python for ML: 20 x 4-hour_modules for $29.95/month)
LearnPython.org
Ultimate Resource Repo
Python Graph Gallery

5) Setup a python environment

Anaconda (https://anaconda.org/) : global environnment to run python scripts within jupyter notebooks
– Update the conda environment in a terminal : conda update --all
Why ? To practice python 3.6, pandas, numpy, jupyter notebooks… in your computer.
How to create a virtual environment ? In a terminal :
– conda create -n envname python=3.6
– source activate envname (to start the virtual environment)
– source desactivate (to stop the virtual environment)
Update a virtual environment :
– Go to folder (cd ...)
– type in a terminal : conda env update

6) Jupyter Notebook

Jupyter notebook : http://jupyter.org/
Try a Jupyter Notebook online
1:30:52 : Introduction to Jupyter notebooks
Jupyter Notebook Commands & Shortcuts :
– Edit mode & Command mode
– Type H in a Jupyter Notebook to get all shortcuts
– Jupyter Notebook Commands & Shortcuts
– 28 Jupyter Notebook tips, tricks, and shortcuts
Usar “Collapsable / expandable jupyter cells” em seus jupyter notebooks
A gallery of interesting Jupyter Notebooks : https://github.com/jupyter/jupyter/wiki/A-gallery-of-interesting-Jupyter-Notebooks

7) Fastai

8) O nosso grupo

Lições :
– Deep Learning Brasília - Lição 1 | Wiki | Notes | Verifique a sua compreensão da lição 1
– Deep Learning Brasília - Lição 2 | Wiki | Notes | Verifique a sua compreensão da lição 2
– Deep Learning Brasília - Lição 3 | Wiki | Notes | Verifique a sua compreensão da lição 3
– Deep Learning Brasília - Lição 4 | Wiki | Notes | Verifique a sua compreensão da lição 4
– Deep Learning Brasília - Revisão (lições 1, 2, 3 e 4)
Documentos online do grupo :
– Brasília part 1 group
– Grupo Whatsapp
– Slack
– Medium
– Perfis
– Agenda
– LeaderBoard
– Projetos
– Recursos
CNN
– Video do Otavio Good : A visual and intuitive understanding of deep learning
– 00:49:45 ConvNet demo with Excel from @jeremy
– A Beginner’s Guide To Understanding Convolutional Neural Networks
– A friendly introduction to Convolutional Neural Networks and Image Recognition
– Video do Yann Lecun sobre Convolutional Neural Networks
– Changing Image Size during training!
Outros recursos :
– Early access to Intro To Machine Learning videos (from @jeremy)
– Intro to Machine Learning videos
– Fun with small image data-sets (Part 2)
– Script Google Images Download
– Kaggle + Script kaggle-cli
– New AdamW optimizer now available na biblioteca Fastai
– Fastai parte 1 : 30+ Best Practices

9) Deep Learning

What is Deep Learning ? A kind of Machine Learning.

Link : https://www.youtube.com/embed/IPBSB1HLNLo?hl=en_US&cc_lang_pref=en_US&cc_load_policy=1&autoplay=1&start=2665&end=2923
Duration : 4 mn 18s (00:44:25 to 00:48:43)
Topics :
– What is Deep Learning ? A kind of Machine Learning.
– Arthur Samuel invented Machine Learning.
– Machine Learning needs features engineering and specific function, created with domain experts.
– Deep Learning is more efficient : a DL algorithm will learn the features by training instead of creating them by hand with domain experts.

The Universal Approximation Theorem & Examples using Deep Learning

Link : https://www.youtube.com/embed/IPBSB1HLNLo?hl=en_US&cc_lang_pref=en_US&cc_load_policy=1&autoplay=1&start=2924&end=3525
Duration : 10 mn 01s (00:48:44 to 00:58:45)
Topics :
– The Universal Approximation Theorem, and examples used by Google corporation (Google stars using DL in 2012 and today, all Google projects use DL).
– To set up a DL algorithm, we need 3 things :

Infinitely flexible function that can solve any problem. This is the Neural Network : a set of linear layers and non linear layers that can approximate any given problem if we have sufficient data for training it (Universal Approximation Theorem). This is a multiple hidden layers network (multiple hidden layers are necessary to get a better accuracy).
Need to setup the parameters of this general function (all-purpose parameters filtering) for each use. This is the Gradient descent : this is the learning method of the parameters of our DL algorithm (after each epoch, we use it on the loss function to improve the value of our parameters by searching the global minimum of the loss function, even if it is often a local minimum).
Need fast and scalable computation. This is the GPU : GPUs are globally 10 times faster than CPUs and cheaper. GPUs are necessary to train our DL models in a reasonably time.
– DL uses : generate automatic answers to an email, Skype Translator, Semantic Style Transfer, cancer detection in medical images.
– A lot of opportunities to solve problems with DL.

More examples using Deep Learning & CNN

Link : https://www.youtube.com/embed/IPBSB1HLNLo?hl=en_US&cc_lang_pref=en_US&cc_load_policy=1&autoplay=1&start=3526&end=3732
Duration : 3 mn 26s (00:58:46 to 01:02:12)
Topics :
– More examples using Deep Learning, as shown in the PowerPoint from Jeremy Howard.
– What is actually going on in a Deep Learning model with convolutional network (CNN).

Non linearity, Gradient Descent & Learning rate

Link : https://www.youtube.com/embed/IPBSB1HLNLo?hl=en_US&cc_lang_pref=en_US&cc_load_policy=1&autoplay=1&start=3733&end=4102
Duration : 3 mn 26s (01:02:13 to 01:08:22)
Topics :
– Adding a Non-Linear Layer to our model, sigmoid or ReLu (rectified linear unit) : the key point is that allows us to create any kind of functions and so to solve any kind of problems.
– SGD (Stochastic Gradient Descent) : in order to find the minimum of an loss function, we can move from a point on the curve to the opposite direction of the gradient (derivative).
– But we need to take a small step (but not too small…) : this is the learning rate.

What “sees” a CNN & Learning rate finder

Link : https://www.youtube.com/embed/IPBSB1HLNLo?hl=en_US&cc_lang_pref=en_US&cc_load_policy=1&autoplay=1&start=4103&end=4899
Duration : 13 mn 16s (01:08:23 to 01:21:39)
Topics :
– A paper on “Visualizing and Understanding Convolutional Networks” : on the left, there are 9 kernel (filters) applied to the input layer that result in 9 feature maps in the first hidden layer (layer 1). Each filter searches a specific basic shape (edge), and on the right, there are the 9 top activations for each filter (within the images set used).
— Key point : the filters are learned (the 9 numbers of its matrix), not programmed.
— Layer 2 is the results of applying a new series of filters (16) on the 9 features maps of layer 1. Again, we show the 9 top activations for each filter. For example, we can see that now there are activations by photos of sunset, horizontal lines or corners : the layer 2 detects things more complicated than layer 1.
— Layer 3 : its filters recognize text or human faces (only 3 layers to perform that detection !).
— Layer 5 : its filters recognize animals…
– Implementation on ‘lesson1.ipynb’
– ‘Cyclical Learning Rates for Training Neural Networks’ with Fastai library as “lr_find” or learning rate finder : it allows to get semi-automatically the right learning rate to train our neural network
– Why it starts training a model but stops before 100%: use Learner Schedule Finder.
– How many epochs to run ? (epochs : go through the entire dataset by batch). As many you want until the accuracy gets worst.

pierreguillou · April 19, 2018, 8:19pm

Pessoal,
quer entender como funciona um CNN ?

3 artigos a estudar :

pierreguillou · April 19, 2018, 9:11pm

Meu post sobre Resnet e a razão pela qual dá certo :
Understand how works Resnet… without talking about residual

pierreguillou · April 21, 2018, 2:45pm

Video do Yann Lecun sobre Convolutional Neural Networks : https://www.youtube.com/watch?v=xgqm6TDhjrQ

pierreguillou · April 21, 2018, 5:15pm

pierreguillou · April 23, 2018, 2:27pm

No sábado, falei sobre o problema de obter as previsões sobre o dataset test quando a gente esqueceu de colocar o nome da sua pasta no objeto data (e ai, antes de treinar o seu objeto learn).

Acabei de encontrar um post dando uma solução que publiquei no ano passado

pierreguillou · April 24, 2018, 10:31am

Video do Otavio Good : A visual and intuitive understanding of deep learning

pierreguillou · April 25, 2018, 12:07am

Uma thread excepcional para entender porque a gente pode mudar o tamanho das imagens com a rede Resnet (mas não com a rede VGG !) : Changing Image Size during training!

pierreguillou · April 25, 2018, 1:12am

Fastai parte 1 : 30+ Best Practices

pierreguillou · June 2, 2018, 9:52pm

Boa noite, foi realmente uma turma de prática hoje !

Com o dataset do Fernando e o script do Welton para criar automaticamente training e validation sets baseados numa percentagem, conseguimos criar uma rede neural de reconhecimento dos senadores

Para as pessoas que precisam das linhas de códigos que o grupo desenvolveu hoje (dentro do caderno lesson1.ipynb) e em particular Lucas Bessa, atualizei a minha página https://github.com/piegu/fastai-projects/blob/master/lesson1-quick.ipynb criada nesta turma de revisão há 1 mês.

Parágrafos atualizados :
** “Setup the path to data”
** “9) Get the prediction of a specific image”
** “Confusion Matrix”
Temas das novas linhas de códigos :
** “Visualize an image after transformation by tfms_from_model()”
** “Display the scatter diagram of the predictions (probabilities) for the tested image”
** “Increase the display size of the confusion matrix”

pierreguillou · July 1, 2018, 11:29am

Mais uma maneira de usar um GPU online with Fastai instalada : “Running fast.ai notebooks with Amazon SageMaker”.

pierreguillou · July 1, 2018, 2:23pm

Oi pessoal,

meu post no Medium sobre “Fastai | How to start ?”. Espero que possa ajudar novos participantes.

Sinta-se à vontade para me perguntar mais informações

pierreguillou · July 7, 2018, 1:33pm

Passo-a-passo para instalar Fastai no seu computador Windows usando o CPU (não o GPU)

Leia o documento https://github.com/fastai/fastai/blob/master/README.md mas siga as etapas seguintes :

instale Anaconda for Windows
Abre o terminal “Anaconda Prompt” (que foi instalado pela Anaconda) e digitalize os comandos seguintes
mkdir fastai (para criar a pasta fastai)
cd fastai (para entrar na pasta fastai)
git clone https://github.com/fastai/fastai.git (para baixar os arquivos Fastai incluindo notebooks e arquivo para instalar o ambiente virtual fastai-cpu: pytorch, bibliotecas numpy, pandas, bcolz, etc.)
conda env update -f environment-cpu.yml (IMPORTANT : use o arquivo environment-cpu.yml porque você quer usar o seu CPU, não um GPU)
conda activate fastai-cpu (para ativar o ambiente virtual fastai-cpu)
cd courses\ml1 (entre na pasta ml1)
del fastai (apague o symlink fastai que foi criado para funcionar no ambiente bash)
mklink /d fastai ..\..\fastai (crie o symlink windows fastai para a pasta fastai que tem os arquivos da biblioteca Fastai)
cd ..\.. (saia da pasta ml1 para votlar para raiz da pasta fastai criada na etapa 3)
jupyter notebook (lance o jupyter notebook que vai abrir-se num navegador Web)

“et voilà” Você tem a biblioteca Fastai (e seus notebooks) instalada no seu computador e pode rodar todos os notebooks da pasta ml1.

Nota : as etapas 8 para 10 permetem a criação de um symlink entre a pasta ml1 e a pasta fastai que tem todos os arquivos da biblioteca Fastai : se quiser rodar os notebooks da pasta dl1 por exemplo, tem de criar também o symlink entre dl1 e a pasta fastai seguindo as etapas de 8 para 10 !
** 8) cd courses/dl1
** 9) del fastai
** 10) mklink /d fastai ..\..\fastai

Kadu · July 28, 2018, 8:15pm

Excelente post Pierre, obrigado pela Ajuda… Apesar de seguir suas orientações, enfrentei problemas com o pacote PyTorch, sempre dava erro Conda HTTPError.

Depois de muita pesquisa, tentativa e erro, consegui contornar este problema e descrevi neste post o passo a passo de como resolvê-lo.

Espero que seja útil…

Oroneki · November 8, 2018, 3:11pm

Moro em Recife. Que inveja desse grupo de vocês! Mt bom!

ananyagupta · January 9, 2021, 5:16am

I don’t understand the word and the discussion which discuss above. But I am interested to know more about Deep Learning because I am joining that course from https://www.cetpainfotech.com/technology/deep-learning-training.