Wiki: Lesson 1

drock · January 17, 2018, 3:16am

try: !rm -r {PATH}train/.ipynb_checkpoints

also do this for valid.

Jaryl · January 18, 2018, 12:25am

Hi guys,how do i resolve this issue? The directory exists, just that there is supposed to be a slash in between Fastaipics and valid, which looks something like Fastaipics/valid… Thanks!

Jaryl · January 18, 2018, 12:29am

I get this when i set replace=False. What does it mean by “cannot take a larger sample than the population” and how do i work around this?

Thanks!

abercher · January 18, 2018, 6:00pm

Hello everyone,

I’m trying to reproduce the first notebook on a sample of the original dogscats dataset (around 200 pictures) by following the instructions given at the end of the notebook (section called “Review: easy steps to train a world-class image classifier”), but I’m a bit confused.
I have difficulty understanding this two times procedure corresponding to the points

Train last layer from precomputed activations for 1-2 epochs
Train last layer with data augmentation (i.e. precompute=False) for 2-3 epochs with cycle_len=1

which I implemented with

learn.fit(0.05, 2)
learn.precompute = False
learn.fit(0.05, 3, cycle_len=1)

There are two things I don’t understand:

Why is data augmentation related to precompute=False? I had the impression that these two things were independent. I thought that the precompute issue means that we have already the weights fixed for the first layers, and that the data augmentation just meant that we artificially produce more data by adding to the original pictures some modifications (rotations, croping, etc…) of them. In which way are the two related?
Why do we do 3 AND 4? Is it a way to initialize the weights to some good values (in 3) and then improve them (in 4), rather than starting with some random weights?

Sorry for my naivety, I’m really a beginner.
Thanks in advance!

abercher · January 19, 2018, 11:16am

Another quick question about how to use lr_find():

I don’t really understand the purpose of the variable lrf in the cell with

lrf=learn.lr_find()

it looks like lrf doesn’t reappear anywhere else in the code. I had the impression that the aim of this lr_find method was to be then able to do the plot with the command

learn.sched.plot()

and then to choose a good learning rate by looking at the plot.

Did I get something wrong?

Thanks in advance.

ecdrid · January 19, 2018, 1:31pm

Precompute and tfms (transformations) have a well discussed thread in this forum… Here’s it
(precompute=True) Answer’s to 3 and 4
lrf=learn.lr_find() we can skip that lrf but it’s quite useful though in the later stages

We can use that lrf as shown below,

lrs = np.array([ 1, 2, 3 ]); learn.lr_find(lrs / 1000)

It is based on this Cyclical Learning Rates for Training Neural Networks

reshama · January 19, 2018, 9:00pm

@nickl
Aha!
It looks like someone submitted a pull request to fix lesson 2, but the Lesson 1 notebook still needs the fix.

I’ll fix the Lesson 1 notebook and submit a pull request.

hnvasa · January 23, 2018, 9:13am

the repo is now updated to work with 2.x and 3.x both!

Xander · January 23, 2018, 5:01pm

Is there a script to set up a conda environment on my own ubuntu machine with all needed libraries for fast.ai? all that is mentioned here so far concerns a cloud installation. Nevertheless, I seem to possess the latest mobile GPU and wanna give it a go.

reshama · January 23, 2018, 6:20pm

perhaps this is what you’re looking for: bash setup script

financepk · January 23, 2018, 7:06pm

I understand the overall concept of the Notebook now. When going line by line, following line stood out

Why is that we use only the probability of dog throughout the notebook. Why don’t we use the probability of cat anywhere? I presume we could’ve used either one as the sum of probability along each row is one or almost 1.

shoof · January 23, 2018, 9:17pm

I think they are moved to the fastai repo.

However, I don’t see the folded table of contents in the notebook from lesson 1. Am I missing something?

tech9connect · January 24, 2018, 7:30am

Hi,
I have just started with fastai Part1v2 course and have finished watching the first video. How are the files divided in Train/Valid etc? How do I know more about these terms and dividing files accordingly?

samching · January 24, 2018, 2:17pm

Interested to find out more about this too!

@alessa / @lukebyrne do you guys have any insights? I saw a post [Wiki: Lesson 1] which talked briefly about this, but I’m not sure if this was ever resolved.

Reason I’m asking is that the ImageClassifierData.from_paths method takes the following args:

            trn_name: a name of the folder that contains training images.
            val_name:  a name of the folder that contains validation images.
            test_name:  a name of the folder that contains test images.

Any insights into the train/valid/test split required to feed into this method will be really helpful.

Thanks!

alessa · January 24, 2018, 2:45pm

Here you find more details stackoverflow

Usually the dogs and cats examples have only train and valid dataset, where the training dataset is 12500 files per class, and the validation dataset is 1000 files per class (~7% of the data).

What you need to pay attention is when you build your datasets, to cut sample files from training dataset and move then in the validation dataset. (Instead of just copy paste).

If you do kaggle competition for example, you will have also a test dataset (with no labels/classes). In this case, you can train your model using the cross-validation technique. And in the end, you can put all of your files in the training dataset (no more validation set), and this will be your final weights.

alessa · January 24, 2018, 2:49pm

Here is a short video on how to split the data by Andrew Ng Coursera

[60% training, 20% validation, 20% test]
You can change these params and check how it affects your final model performance.

samching · January 24, 2018, 6:27pm

Thanks @alessa! This was helpful for a general train / test / split. I was wondering more specifically -
do you know of any specific setup requirements for the train / test / split for fastAi’s method?

Thanks again.

Matthew · January 25, 2018, 10:53am

When choosing a learning rate with the LR finder, you can plot a vertical line to ensure you choose the correct x-coordinate of the point you’re interested in. Otherwise it can be difficult to interpret values on the x-axis, since they’re in log scale.

import matplotlib.pyplot as plt
learn.sched.plot()
plt.axvline(x=1.6e-2, color="red");

ecdrid · January 25, 2018, 12:43pm

Also adding %matplotlib notebook seems awesome (edit the same plot until created a new one)

gnavink · January 25, 2018, 4:05pm

Kaggle CatsDogs Redux Kernel competition asks us to report whats the probability of that image to be a dog.hence interested in calculating dog probability