Part 1: complete collection of video timelines

EricPB · September 15, 2017, 8:55pm

As a companion to the post “Part #2: complete collection of video timelines”, please find his twin brother for Part #1 below.
Note: this post is an ensemble of the video timelines created by interns & students of Part #1 in the Wiki; I made some editing to keep the flow consistent between lessons.

The full Part #2 video syllabus is available here:

Lesson 1 video timeline

00:00:00 - Fast AI & the course

https://youtu.be/Th_ckFbc6bI?t=1s

00:05:29 - Why Deep Learning is exciting

https://youtu.be/Th_ckFbc6bI?t=5m29s

00:10:51 - Deep Learning setup

https://youtu.be/Th_ckFbc6bI?t=10m51s

00:16:02 - Deep Learning trends and applications

https://youtu.be/Th_ckFbc6bI?t=16m2s

00:20:06 - Starting your AWS instance

https://youtu.be/Th_ckFbc6bI?t=20m6s

00:27:07 - Introduction to Jupyter Notebooks

https://youtu.be/Th_ckFbc6bI?t=27m7s

00:33:43 - Introduction to Kaggle

https://youtu.be/Th_ckFbc6bI?t=33m43s

00:41:14 - Introduction to tmux

https://youtu.be/Th_ckFbc6bI?t=41m14s

00:52:57 - Kaggle Dogs vs. Cats data & general data structuring tips

https://youtu.be/Th_ckFbc6bI?t=52m57s

01:01:01 - Introduction to Markdown

https://youtu.be/Th_ckFbc6bI?t=1h1m

01:02:02 - Introduction to some scientific Python libraries

https://youtu.be/Th_ckFbc6bI?t=1h2m

01:09:23 - Pre-trained models & ImageNet

https://youtu.be/Th_ckFbc6bI?t=1h9m20s

01:15:15 - VGG model

https://youtu.be/Th_ckFbc6bI?t=1h15m15s

01:17:08 - Implementing VGG

https://youtu.be/Th_ckFbc6bI?t=1h17m8s

01:22:14 - Python stack being used

https://youtu.be/Th_ckFbc6bI?t=1h22m14s

01:23:48 - Theano vs. TensorFlow

https://youtu.be/Th_ckFbc6bI?t=1h23m48s

01:27:02 - Keras and Theano settings

https://youtu.be/Th_ckFbc6bI?t=1h27m2s

01:30:20 - Batches

https://youtu.be/Th_ckFbc6bI?t=1h30m20s

01:34:38 - Finetuning ImageNet VGG16 for Dogs vs. Cats

https://youtu.be/Th_ckFbc6bI?t=1h34m38s

Lesson 2 video timeline

00:0:09 - Teaching Approach

https://youtu.be/e3aM6XTekJc?t=9s

00:05:22 - How to Ask For Help (Tips)

https://youtu.be/e3aM6XTekJc?t=5m20s

00:07:10 - How to Ask For Help (Example)

https://youtu.be/e3aM6XTekJc?t=7m10s

00:08:30 - Class Resources: Wiki

https://youtu.be/e3aM6XTekJc?t=8m30s

00:09:55 - Class Resources: Forum

https://youtu.be/e3aM6XTekJc?t=9m55s

00:10:25 - Class Resources: Slack

https://youtu.be/e3aM6XTekJc?t=10m25s

00:11:20 - Class Survey

https://youtu.be/e3aM6XTekJc?t=11m20s

00:17:14 - Solution to Dogs vs Cats Redux Competition

https://youtu.be/e3aM6XTekJc?t=17m14s

00:17:30 - Downloading the Data

https://youtu.be/e3aM6XTekJc?t=17m30s

00:20:00 - Planning (Overview of Tasks)

https://youtu.be/e3aM6XTekJc?t=20m

00:20:25 - Preparing the Data (Validation and Training Set)

https://youtu.be/e3aM6XTekJc?t=20m25s

00:22:15 - Using Vgg16 (Finetune and Train)

https://youtu.be/e3aM6XTekJc?t=22m15s

00:22:48 - Submitting to Kaggle

https://youtu.be/e3aM6XTekJc?t=22m48s

00:30:30 - Competition Evaluation Metric: Log Loss

https://youtu.be/e3aM6XTekJc?t=30m30s

00:37:18 - Experiment: Running More Epochs

https://youtu.be/e3aM6XTekJc?t=37m18s

00:40:37 - Visualizing Results

https://youtu.be/e3aM6XTekJc?t=40m37s

00:47:37 - Introducing the Kaggle State Farm Competition

https://youtu.be/e3aM6XTekJc?t=47m37s

00:50:29 - Question: Will ImageNet Finetuning Approach work for CT Scans?

https://youtu.be/e3aM6XTekJc?t=50m29s

00:53:10 - Lesson 0 Video, Convolutions

https://youtu.be/e3aM6XTekJc?t=53m10s

00:54:09 - Why do we do finetuning?

https://youtu.be/e3aM6XTekJc?t=54m9s

00:54:43 - What do CNNs learn?

https://youtu.be/e3aM6XTekJc?t=54m43s

01:03:30 - Deep Neural Network in Excel

https://youtu.be/e3aM6XTekJc?t=1h3m30s

01:07:54 - Initialization

https://youtu.be/e3aM6XTekJc?t=1h7m50s

01:14:08 Linear Model from Scratch

https://youtu.be/e3aM6XTekJc?t=1h14m5s

01:15:10 - Loss function

https://youtu.be/e3aM6XTekJc?t=1h15m10s

01:15:49 - Update function

https://youtu.be/e3aM6XTekJc?t=1h15m49s

01:24:40 Question: What if you don’t know derivative of functions?

https://youtu.be/e3aM6XTekJc?t=1h24m40s

01:25:37 Linear Model in Keras

https://youtu.be/e3aM6XTekJc?t=1h25m37s

01:29:58 Linear Model with CNN Features for Dogs Vs Cats Redux

https://youtu.be/e3aM6XTekJc?t=1h29m58s

01:44:12 Introducing Activation Functions

https://youtu.be/e3aM6XTekJc?t=1h44m12s

01:46:51 Universal Approximation Theorem

https://youtu.be/e3aM6XTekJc?t=1h46m51s

01:48:20 Review: Vgg16 Finetuning

https://youtu.be/e3aM6XTekJc?t=1h48m20s

Lesson 3 video timeline

00:00:10 - How to use the provided notebooks

https://youtu.be/6kwQEBMandw?t=10s

00:08:48 - Video of CNN visualization

https://youtu.be/6kwQEBMandw?t=8m45s

00:13:11 - CNN review

https://youtu.be/6kwQEBMandw?t=13m10s

00:26:34 - VGG review

https://youtu.be/6kwQEBMandw?t=26m30s

00:30:13 - Max Pooling review

https://youtu.be/6kwQEBMandw?t=30m10s

00:32:12 - CNNs Q&A

https://youtu.be/6kwQEBMandw?t=32m10s

00:42:32 - Softmax Function

https://youtu.be/6kwQEBMandw?t=42m30s

00:49:40 - SGD review

https://youtu.be/6kwQEBMandw?t=49m35s

00:53:10 - More CNNs Q&A

https://youtu.be/6kwQEBMandw?t=53m10s

00:59:12 - Finetuning Review

https://youtu.be/6kwQEBMandw?t=59m10s

01:12:52 - Underfitting and Overfitting

https://youtu.be/6kwQEBMandw?t=1h12m50s

01:28:42 - Approaches to reducing overfitting

https://youtu.be/6kwQEBMandw?t=1h28m40s

01:31:17 - Data Augmentation

https://youtu.be/6kwQEBMandw?t=1h31m15s

01:39:55 - Batch Normalization

https://youtu.be/6kwQEBMandw?t=1h39m50s

01:48:50 - End-to-End Model Building Process for MNIST

https://youtu.be/6kwQEBMandw?t=1h48m50s

01:57:17 - Ensembling

https://youtu.be/6kwQEBMandw?t=1h57m15s

Lesson 4 video timeline

00:00:0 - CNN review (excel)

https://youtu.be/V2h3IOBDvrA?t=1s

00:11:28 - SGD (excel)

https://youtu.be/V2h3IOBDvrA?t=11m25s

00:11:43 - CNN/SGD Q&A

https://youtu.be/V2h3IOBDvrA?t=11m40s

00:26:31 - Visualizing SGD in 2D and 3D

https://youtu.be/V2h3IOBDvrA?t=26m30s

00:28:53 - Visualizing and explaining Momentum in 3D

https://youtu.be/V2h3IOBDvrA?t=28m50s

00:32:20 - Momentum

https://youtu.be/V2h3IOBDvrA?t=32m20s

00:34:35 - Dynamic Learning Rates and Adagrad

https://youtu.be/V2h3IOBDvrA?t=34m35s

00:41:15 - RMSprop

https://youtu.be/V2h3IOBDvrA?t=41m15s

00:46:14 - Adam

https://youtu.be/V2h3IOBDvrA?t=46m10s

00:49:00 - Eve

https://youtu.be/V2h3IOBDvrA?t=49m

00:53:52 - Jeremy’s approach to automatic learning rate annealing

https://youtu.be/V2h3IOBDvrA?t=53m50s

00:56:57 - Jeremy’s solution to Kaggle’s “State Farm Distracted Driver Detection”

https://youtu.be/V2h3IOBDvrA?t=56m55s

01:22:05 - Knowledge Distillation (Geoffrey Hinton, Jeff Dean: distilling the knowledge in a Neural Network)

01:22:50 - Introduction to Semi-Supervised Learning

https://youtu.be/V2h3IOBDvrA?t=1h22m50s

01:23:45 - Pseudo-Labeling

https://youtu.be/V2h3IOBDvrA?t=1h23m40s

01:25:35 - Jeremy’s Kaggle solution Q&A

https://youtu.be/V2h3IOBDvrA?t=1h25m35s

01:36:01 - Collaborative Filtering

https://youtu.be/V2h3IOBDvrA?t=1h36m

01:51:45 - Collaborative Filtering Q&A

https://youtu.be/V2h3IOBDvrA?t=1h51m45s

01:58:26 - Collaborative Filtering (continued)

https://youtu.be/V2h3IOBDvrA?t=1h58m20s

Lesson 5 video timeline

00:00:01 - Tips to get 98.94 acc on Cats and Dogs Redux

https://youtu.be/qvRL74L81lg?t=1s

00:01:55 - Introducing Batch Normalization into a Pre-Trained Model
& Batch Norm Review + using Batch Norm with VGG

https://youtu.be/qvRL74L81lg?t=1m55s

00:10:00 - Collaborative Filtering & Bias Model

https://youtu.be/qvRL74L81lg?t=10m

00:13:45 - Adding regularization to loss function

https://youtu.be/qvRL74L81lg?t=13m45s

00:15:40 - Analyzing Parameters
& Bias + Latent Factors + PCA

https://youtu.be/qvRL74L81lg?t=15m40s

00:23:40 - Keras Functional API
& An Aside on Embeddings Functions

https://youtu.be/qvRL74L81lg?t=23m40s

00:34:00 - Natural Language Processing
& Sentiment Analysis

https://youtu.be/qvRL74L81lg?t=34m

00:44:30 - Single hidden layer model

https://youtu.be/qvRL74L81lg?t=44m30s

00:56:00 - CNN model & Aside on 1-Dimensional Convolutions

https://youtu.be/qvRL74L81lg?t=56m

01:12:00 - Unsupervised Learning for Word Embeddings
& Visualizing Word Embeddings

https://youtu.be/qvRL74L81lg?t=1h12m

01:31:00 - Using Glove for sentiment analysis

https://youtu.be/qvRL74L81lg?t=1h31m

01:36:00 - Multi-Size CNN’s

https://youtu.be/qvRL74L81lg?t=1h36m

01:43:06 - Recurrent Neural Network (RNN) & the Need for RNN’s

Thinking about Neural Networks as Computational Graphs

https://youtu.be/qvRL74L81lg?t=1h43m

01:59:00 - RNN example code for words prediction

https://youtu.be/qvRL74L81lg?t=1h59m

Lesson 6 video timeline

00:00:01 - Pseudo-labeling

https://youtu.be/ll9y1U0SoVY?t=1s

00:01:15 - MixIterator introduction

https://youtu.be/ll9y1U0SoVY?t=1m15s

00:06:57 - Review: Embeddings

https://youtu.be/ll9y1U0SoVY?t=6m50s

00:08:10 - Embeddings example: MovieLens Data Set

https://youtu.be/ll9y1U0SoVY?t=8m10s

00:13:30 - Word embeddings example: Green Eggs and Ham

https://youtu.be/ll9y1U0SoVY?t=13m30s

00:15:33 - RNNs

https://youtu.be/ll9y1U0SoVY?t=15m30s

00:20:00 - Visual vocabulary for representing neural nets

https://youtu.be/ll9y1U0SoVY?t=20m

00:22:56 - 3 kinds of layer operations

https://youtu.be/ll9y1U0SoVY?t=22m50s

00:25:30 - Building first char-RNN in Keras

https://youtu.be/ll9y1U0SoVY?t=25m30s

00:27:28 - Predict 4th character from previous 3

https://youtu.be/ll9y1U0SoVY?t=27m25s

00:38:45 - Generalize first char-RNN formulation: Predict char n from chars 1 to n-1

https://youtu.be/ll9y1U0SoVY?t=38m45s

00:42:20 - RNN from standard Keras dense layers

https://youtu.be/ll9y1U0SoVY?t=42m20s

00:48:25 - Initialization for hidden to hidden dense layer (identity matrix)

https://youtu.be/ll9y1U0SoVY?t=48m25s

00:51:36 - Alternative char-RNN formulation: Predict chars 2 to n using chars 1 to n-1 (sequence to sequence)

https://youtu.be/ll9y1U0SoVY?t=51m30s

01:02:08 - Stateful model with Keras (long-term dependencies)

https://youtu.be/ll9y1U0SoVY?t=1h2m

1:04:30 - Exploding gradients/activations

https://youtu.be/ll9y1U0SoVY?t=1h4m30s

01:05:55 - LSTM introduction

https://youtu.be/ll9y1U0SoVY?t=1h5m50s

01:12:07 - Use of TimeDistributed

https://youtu.be/ll9y1U0SoVY?t=1h12m5s

01:16:50 - Experiments with stacked LSTM

https://youtu.be/ll9y1U0SoVY?t=1h16m50s

01:23:01 - Build RNN in Theano

https://youtu.be/ll9y1U0SoVY?t=1h23m

01:25:46 - Aside: “loss=sparse_categorical_entropy” alternative to one-hot encoding of output

https://youtu.be/ll9y1U0SoVY?t=1h25m40s

01:27:30 - Aside: One-hot sequence model with Keras

https://youtu.be/ll9y1U0SoVY?t=1h27m30s

01:28:50 - Theano overview

https://youtu.be/ll9y1U0SoVY?t=1h28m50s

01:29:50 - Theano concepts: Variable

https://youtu.be/ll9y1U0SoVY?t=1h29m50s

01:35:50 - “theano.scan” operation (RNN steps)

https://youtu.be/ll9y1U0SoVY?t=1h35m50s

01:39:47 - Scan calls step function

https://youtu.be/ll9y1U0SoVY?t=1h39m40s

01:43:20 - Theano error/loss

https://youtu.be/ll9y1U0SoVY?t=1h43m20s

01:43:48 - “theano.grad” calculate derivatives

https://youtu.be/ll9y1U0SoVY?t=1h43m45s

01:44:43 - “theano.function”

https://youtu.be/ll9y1U0SoVY?t=1h44m40s

01:49:06 - Lesson goals, plans

https://youtu.be/ll9y1U0SoVY?t=1h49m05s

01:50:15 - In-class questions

https://youtu.be/ll9y1U0SoVY?t=1h50m15s

01:56:59 - Tip: Exploring layer definitions in keras

https://youtu.be/ll9y1U0SoVY?t=1h56m50s

02:01:05 - Tip: shift-tab

https://youtu.be/ll9y1U0SoVY?t=2h1m

02:01:40 - Tip: Python debugger in Jupyter notebook

https://youtu.be/ll9y1U0SoVY?t=2h1m35s

Lesson 7 video timeline

TBD

CNN architectures: resnet, inception, fully convolutional net, multi input and multi output nets;
Localization with bounding box models and heatmaps;
Using larger inputs to CNNs;
Building a simple RNN in pure python;
Gated recurrent units (GRUs), and how to build a GRU RNN in theano

EricPB · October 23, 2017, 9:03am

I’m looking for the exact lesson and time where Jeremy presents the “Distilling the Knowledge in a Neural Network” paper written by Hinton and Dean, I missed it
https://arxiv.org/abs/1503.02531

If you have the info, please post it here so I can update the list.

EricPB · October 31, 2017, 7:11pm

I found the lesson and time for Knowledge Distillation in Lesson 4, and updated the post accordingly.

01:22:05 - Knowledge Distillation (Geoffrey Hinton, Jeff Dean: distilling the knowledge in a Neural Network)

https://youtu.be/V2h3IOBDvrA?t=1h22m5s

tastingsilver · October 31, 2017, 8:20pm

This is really great, thanks. Have you considered adding to wiki?