Lesson 1 In-Class Discussion ✅

gosftw · May 28, 2019, 2:03pm

Hi everyone, im new here. i have some background on on DL but all on tf, keras i see the first lesson and im really interesting on complete all the course. wich material u recommend to read to be up to date with fastai lib and Pytorch? only docs will be ok?

In another hand, any chance to be online assistant of part 2 of the course?

Kindle regards,

Manuel

foobar8675 · May 28, 2019, 5:07pm

thank you.

abyaadrafid · May 29, 2019, 5:17am

Hi,
I am reading train and test imagelists from pandas.dataframe and loading it into a databunch.
But, I want to get the dataset object for train and test sets. How can I do that?

code :

test_imagelist=ImageList.from_df(test,path=path,folder='test_images',cols='id',suffix='.jpg')
train_imagelist=ImageList.from_df(train,path=path,folder='train_images',cols='id',suffix='.jpg')
src=(train_imagelist.split_by_rand_pct(valid_pct=0.2,seed=42)
      .label_from_df('category_id')
      .add_test(test_imagelist))

data=(src.transform(get_transforms(),size=224)
      .databunch()
      .normalize(imagenet_stats))

I tried calling src.datasets() at every step but always getting errors.
What am I doing wrong?

naive · May 31, 2019, 5:19am

yes it is working

naive · May 31, 2019, 5:23am

You chose “1e-6 to 1e-4” in which the losses are between 4.00 to 4.25 and the variation is not much.i.e. losses seems to be unaffected by the learning rate in that range.
And when you use “1e-1” the losses rise exponentially from there.
Try using the optimum value “1e-02”
Hope it helps

naive · June 1, 2019, 10:59am

Hi,
The slides are not available directly as .ppt format but you can find the slide screenshots on the following links:

github.com

hiromis/notes/blob/master/Lesson1.md

# Lesson 1
[Webpage](http://course-v3.fast.ai/) / [Video](https://youtu.be/BWWm4AzsdLk) /  [Lesson Forum](https://forums.fast.ai/t/lesson-1-official-resources-and-updates/27936) / [General Forum](https://forums.fast.ai/t/faq-resources-and-official-course-updates/27934/1)



## Welcome! 

Make sure your GPU environment is set up and you can run Jupyter Notebook

[00_notebook_tutorial.ipynb](https://github.com/fastai/course-v3/blob/master/nbs/dl1/00_notebook_tutorial.ipynb)


Four shortcuts:

- <kbd>Shift</kbd>+<kbd>Enter</kbd>: Runs the code or markdown on a cell

- <kbd>Up Arrow</kbd>+<kbd>Down Arrow</kbd>: Toggle across cells

- <kbd>b</kbd>: Create new cell

This file has been truncated. show original

patrolsecurity · June 1, 2019, 7:16pm

Hi , when I log to the machine in for fast AI template, I should see a data directory among other directory (anaconda3 data downloads fastai), I do not see it, I don`t know why? looking for your help. Thank you

happylearning · June 3, 2019, 4:57pm

Hi all, new to DL here. I got some decent results from comparing 2 cat breeds, but unfreezing made everything worse. The lecture said to try and look at the plot and pick a low loss point so we can increase accuracy. My chart looks a bit different from the other ones I’ve seen after running learn.recorder.plot(). Can someone help me figure out how to get better results after unfreezing? Also, the lines aren’t the clearest, so is it okay to just eyeball where that dip is on my chart?

lexicon · June 3, 2019, 6:52pm

Hello,

I’ve just finished week #1.
One question: Can we use single label for classification, in a boolean sense? Like in category or not in category?

What I’m trying to do as practice is to train a model with images of a single city and when given an image as an input, it should tell if the image is that city’s or not.

Thanks.

Mark_F · June 3, 2019, 8:37pm

Hi, everyone.

I am using Salamander

I have only rudimentary computer science knowledge so I apologize if this is a simple question, but I am stuck on untar_data(). Specifically, the first argument is (url:str,…). If I pass an external url to untar_data, I get an error. It seems that the example used in the course (“URLs.PETS”) is part of the class “URLs”, and when I call help, it provides a list of available datasets. Why can I not pass an external URL to untar_data? How do I modify an external URL (eg. CIFAR-10 [https://www.cs.toronto.edu/~kriz/cifar.html]) so that I can use it in the Jupyter notebook? I am getting an error that it is not a tar zip file (sorry I forget the exact errror).

Also, the paths for the pets dataset is ‘/home/ubuntu/.fastai/data/oxford-iiit-pet’. I have found this path on the terminal but cannot find it in the Salamander GUI directory. I am seeing “cifar-100.tgz.tgz” and cifar-10-python.tar.gz.tgz" in this directory, so it must have uploaded at some point when I entered the URL.

Finally, in the Image_Data_Bunch.from_name_re () method, why do we need to specify both path and fnames? Both seem to be path objects, and the fnames argument is just longer, with the file names. Does Image_Data_Bunch subtract path from fnames to determine the file name and apply the regular expression?

Any help appreciated. Thanks.

Anders · June 4, 2019, 12:55am

Sure! Let’s say your city is New York City. You’d have 2 labels: nyc and not_nyc. As far as the model is concerned, that’s no different than the labels dog and cat. In fact, you could find New Yorkers who’d say the difference between nyc and not_nyc is bigger than between cars and dogs.

Another ex: hotdog and not_hotdog, which was featured on an app on the TV show Silicon Valley.

Here’s an example of someone recreating it using fastai:

lexicon · June 4, 2019, 10:39am

Thank you very much Anders, never thought of it this way, training non_city with images of different cities. I was more thinking in terms of not training non_city at all.

Thanks again, much appreciated.

mgloria · June 4, 2019, 1:51pm

Beginner question: When I use the imageDataBunch method from folder, classes are numbered alphabetically (i.e. clas1 is 0 in the predictions, clas2 is 1 etc). How could I change this order? In my case (anomaly detection), I have 2 classes (‘good’ and ‘bad’) and I would like ‘bad’ to get index 1 and ‘good’ index 0 as it is usually the convention. Thanks a lot!

Anders · June 4, 2019, 2:08pm

You’re welcome! I’m still very new at this, but I think one of the major pieces of doing deep learning or any machine learning is figuring out, how do I get turn my problem into something computers are good at doing?

It’s like cooking on an outdoor grill. Grills are great at cooking food that’s a certain size, like a burger. They’re not great at cooking small pieces of meat or vegetables — they’ll fall through the grill. That’s why people use skewers/kabobs: they use the skewer to “transform” several too-small pieces into one big enough piece, and now you can grill it.

Similarly, if a deep learning method needs more than 1 value, you change the way you’re defining your problem to give it 2 values.

tedmasterweb · June 4, 2019, 6:01pm

Hi,

I’m totally new at this so please forgive me if I’m asking obvious questions. I’ve gone through the first lesson a few times now and although I can shift+enter to run the code, I don’t feel like I know what I’m doing. I feel like I need an introductory course, maybe with some history of how deep learning works. That aside, I do have a specific question.

Regarding most_confused(), if the system is able to tell me which ones it got wrong, why didn’t it just adjust and get them right? And how does it know which ones it got wrong?

Really sorry if this is obvious but I don’t see this explained anywhere in the first lesson.

Thanks in advance.

Kind regards,

Ted Stresen-Reuter

abyaadrafid · June 4, 2019, 6:32pm

Hi Ted, I’m a beginner as well but I’ll try to explain things the best I can.

most_confused() gives us the instances it got wrong from the validation set. The model can not adjust and get them right, because it does not “read” (train) from the validation set. Validation set is kept aside to determine how good our model is doing.

Regarding how it knows which ones are wrong: Items in the validation set actually have their label. Our model first predicts what the item label is (with a certain confidence value) then looks at the actual label and determines whether it predicted right or wrong.

To sum it up, we have two sets of data :

Training set
Validation set
The model first looks at the training set items with its labels. Then predicts the labels for items in the validation set. Then checks whether it predicted correctly and how wrong (or right) it was in predicting.

Hope this helped.

tedmasterweb · June 4, 2019, 6:48pm

Hi @abyaadrafid!

That helped a lot. Thank you very much! It now seems obvious. I don’t know why I wasn’t seeing it before!

Thanks again.

abyaadrafid · June 4, 2019, 6:56pm

Happens to the best of us.
Also, welcome to the forums

lexicon · June 4, 2019, 9:16pm

Really great analogy -I might use it some day-

Thanks!

polohot · June 5, 2019, 10:14am

Hello, I am trying to run the code in spyder coz I have been using it since Fastai V0.7
I got stuck at the first

learn.fit_one_cycle(4)

It doesn’t show the training epochs in the spyder console
instead it shows this

<IPython.core.display.HTML object>

Is there any workaround to show the training details in the spyder console?

Many Thanks