Wiki thread: lesson 1

jacksonisaac · March 6, 2019, 6:51pm

You can add a folder (e.g., custom) within fastai module directory. Your directory might look something like -

fastai/
– tabular
– utils
– …
– custom

You may add user defined modules under custom folder. git pull won’t overwrite/revert your changes. If you want to have git tracking for custom files as well, you may want to fork the fastai repo and push your custom code to the fork.

In your code, you would import the function as -
from fastai.custom import function_name

You can add the module to existing folder as well, but to keep user code separated from upstream it might be easier to have it under a different folder.

averma · March 21, 2019, 6:13pm

Yeah,sometimes the server is down . It’s very frequent in kaggle

drauni · March 27, 2019, 7:25am

Hi,
as suggested in the video I set up an account with crestle.
However the course data was not in there.
Then I followed the setup instructions.

Now I am getting the following error:
ModuleNotFoundError: No module named ‘cv2’ when importing fastai.imports

What do I need to do?

Thanks!

edwardeasling · March 27, 2019, 7:34am

I had a similar issue. I was able to get it working by installing all of the packages listed in this blog post

drauni · March 28, 2019, 7:14pm

Thank you so much! This blog post should be pinned at the beginning of this page.

francisc · March 30, 2019, 7:51pm

Hi,
I am a bit stuck after setting up fastai on ubuntu (windows subsystem for linux).
I tried to run the code from the jupyter notebook lesson-rf1, but it gets stuck at the second cell. It throws this error:

Any help would be appreciated.

Edit: Found a solution here.

JeroenvV · April 4, 2019, 12:36pm

I have deployed the DSVM image (Linux version) in Azure and I’m able to connect to the machine via a SSH session. However when I browse to the machine remotely and succesfull login I receive an error message:
500 : Internal Server Error.

Your help is much appriciated.

KR, Jeroen

rgarcia · April 16, 2019, 5:51pm

On the 3rd video / lesson he explains that you will use the dictionary values stored on nas to “process” the Test Set in the same way you did to the Training Set, in order to produce the predictions that you want to submit.

train_set= f ( train_csv )
the nas is somehow similar (not equal) to that f(x)
so then you do
val_set= f ( val_csv )

arajendran · April 30, 2019, 1:06pm

To add on to what @jacksonisaac mentioned, we have to fill the missing value with something. And by filling it with the median we preserve the overall median of the data, with likely a minimal impact on mean and standard deviation. Filling with other values will have a different impact on the distribution of data.

pgxplorer · May 10, 2019, 1:53pm

proc_df() is converting DataFrame input into List

I am trying to predict output for the test data.

type(test_df) # o/p: pandas.core.frame.DataFrame
# test_df is a DataFrame numeric values along with missing values
test_df = proc_df(test_df)
# test_df is now a list
type(test_df) # o/p: list

test_df is getting converted to list. I have no idea why.
Any help? Thanks

Sfundo · May 13, 2019, 6:15pm

hello everyone can some please help, i am currently doing the intro to machine learning course, part 2 specifically, and i decided to take a leap of faith and analysed a random data set, and i got these scores, but because i’ve never seen them it’s really hard to say what they mean, i am use to less scores, eg 0.025

but this:
a) rmse of training set, b) rmse of validation, c) score of train, d) score of valid
[2.6473913845249957, 5.991807398439978, 0.9940291743929885, 0.8914779738266136]

i’ve tried everything, none is working,

if you want i can upload everything to github, for better analysis.

Thanks in advance

taiharry108 · May 15, 2019, 4:55pm

In the course video, @jeremy said something about Machine Learning Driven EDA. I’ve finished lesson 4 and I still don’t really get that what means. Will he talk about it later in the course?

taiharry108 · May 15, 2019, 4:56pm

It’d definitely help if you could share your data and code for the result!

taiharry108 · May 15, 2019, 5:02pm

the first item of the list is the dataframe with independent variables and the second is the dependent variable if you passed the column name of it.

christineseven · July 8, 2019, 3:55pm

Hello! I try to find ml1 but there is not. I use google cloud and I follow this path tutorials/fastai/course-v3//nbs/dl1. There is no option ml1! Nore somewhere in tutorials!
I updated the course repo. Did anybody have the same problem or has an idea how could I fix it?
Thanks!

Yarduza · August 18, 2019, 6:22pm

I’m following the lesson with FloydHub, which the best option I found until now, but I’ve noticed that the GPU utilization is stuck on 0 while running the random forest.

I checked if the CUDA driver is being being recognized and it is. I tried to play with it but with no success, anyone has an idea why might the fastai library not utilize the GPU? (CPU is at 100%)

craquiest · August 20, 2019, 2:06pm

Hi,
I hope you found what you were looking for by now (6 weeks later), and you dont need this answer.
But just in case you do: clone the fastai/fastai repo, you can find ml1 in the courses folder.
Here is a link:
ML1 folder

Hope this helps if needed, but I hope even more than you had already solved your pb.
Cheers,
Lamine

minh · August 23, 2019, 11:39am

Hello every one. I see that we don’t have a clear tutorial on how to connect to AWS and open a Jupyter notebook from Windows, so I want to share how I did it here. Maybe this topic is too basic for a lot of people but for a complete beginner like me it took me several hours to figure it out, especially when I didn’t even know what an SSH tunnel is. So I think it will help a lot of beginners to quickly get through this step and get to experimenting with the lecture.

sturkian · August 23, 2019, 7:04pm

Can I do this course on my Mac? I tried installing fastai but there seems to be alot of fixes I need to do beforehand.

jcatanza · August 23, 2019, 11:59pm

Hi @sturkian The easiest option in my opinion is to use Google Colab. Here’s a Jupyter notebook to guide you through the steps needed to properly run the course notebooks in Google Colab: Colab setup for Part 2 2019