Another treat! Early access to Intro To Machine Learning videos


(Jeremy Howard (Admin)) #402

X is the independent variables, y is the dependent. These are used nearly universally in ML so you’ll be seeing them a lot!

It’s not that important frankly; you’ll learn later in the course that OOB scores are only provided if you ask for them, so we can’t print them out if they’re not there. Googling for hasattr will give details on the function itself - it’s part of standard python.


(Jeremy Demlow) #403

Awesome and thank you for the response, I got a little confused because
"def fit(self, X, y, sample_weight=None):" i didn’t realize that was just the data.frame

I will be googling hasattr() and looking at the source code here shortly.

Jeremy,

I do have two questions for you if you don’t mind.

1.) i am currently taking the DL course that you have created and now that i found this I want to take the ML and DL course. Do you recommend taking them side by side or independently

2.) Also does the fast.ai library need to be run on an instance or can it be used similar to any other package? I would like to be able to use the fast.ai libraries functionality on class projects with out having to be on a server and when i try to import fast.ai i get the following: (I have no issues when i am using it on AWS)

ModuleNotFoundError Traceback (most recent call last)
in ()
1
2 # This file contains all the main external libs we’ll use
----> 3 from fastai.imports import *
4 from fastai.transforms import *
5 from fastai.conv_learner import *

ModuleNotFoundError: No module named ‘fastai’

I have googled this and everything on google is from the forums or related to your course or all mention be apart of an instance and if that is the case it’s okay I will just have to pay the money to run the instance. I am currently in transition to Paperspace as AWS bill was very expensive.

I am so glad that I ended up finding this course from the forums. I am attending LSU and going through their analytic program and I am using your DL course as an independent study, being able to learn practical and cutting edge techniques that can be used right away is priceless. With the added bonus of then going in and understanding how they work is a learning method i am loving.

Again thank you for all your help,

Jeremy


(Ahmad Arib) #404

Try to answer your question based on Jeremy answer that I read so many times, haha.

  1. He suggests taking it side by side, I start from DL, up until lesson 3, keep hear Jeremy suggest taking ML course, switch to ML video until lesson 10, now stuck in ML project academic paper using Random Forest. Will continue ML until finish then back to DL to finish I guess.

  2. what do you mean by “similar to any other package?” and “without having to be on a server”, is that mean you would like to run it on your local machine? PC or laptop?
    I run fastai di cluster server, similar to AWS and paperspace for DL related.
    While just using my laptop for ML related, you just need to install it like this page tell you https://github.com/fastai/fastai
    And I run fastai both in cluster and laptop using Linux Ubuntu.


(Jeremy Demlow) #406

ahmadarib,

I would like to be able to use the fastai library in the anaconda environment, but on my local machine not using a instance on paperspace.

I attempted to use the resource that you gave me, but to no avail. This could be because I wasn’t using the right command prompt. I have a windows 10 & 64 bit computer.

I had to use git bash to get the copy of the repo and used python install setup.py and I thought that did the trick inside the anaconda environment.

At the end of the day i can always use Paperspace to do this, but I’d prefer not to be charged to be able to experiment with ML course.


(Rishaan S Patel) #407

No module found usually refers to the package not being installed on your computer. When you write from fastai.imports import * you are basically telling the computer to go grab fastai library from your computer and load it up because you are about to use. But if the package isn’t installed, then you get that error. Python has multiple ways of installing packages like pip and anaconda, so you might want to look into those. Running pip install bcolz in your terminal should install the package.


(Ahmad Arib) #408

Ahh, my bad, I point you the github link that only could work out of the box if you’re using Linux machine.
And just like a week ago I installing fastai in my girlfriend laptop that also use windows 10 to learn ML videos and practicing it locally, and success, don’t give up yet, lot’s of people could do what you want.
I follow this link How to set up Windows 10 for fast.ai
Get stuck in first option, succeed in second option.
However, thread owner said it’s already obsolete, you need to check out this link also Howto: installation on Windows

Just follow the link above, you need to use Anaconda Prompt to execute everything.
And bcolz is one of them, please follow that 2 link first until sucess, using Anaconda Prompt.
And let us know if you still encounter problem like no module named bcolz, or etc.
Because when you already do right step by step, that error should not be appear at all.


(swetha Godi) #409

That’s an awesome idea :slight_smile: I shall try that out.
Thanks for sharing the information.


(swetha Godi) #410

Thanks for sharing :slight_smile:


(Jeremy Demlow) #411

Howto: installation on Windows “For Context Here Is the Link Used” Make Sure you’re running as an administrator

I made the mistake of not running as an admin the first through and first time I did this the conda env update did work and it didn’t get stuck, but as an admin it took a little longer so be prepared to wait a couple mins and don’t panic

Followed the rest of the sets and was able to get jupyter to run and the clone worked as you can see

The issue is when I run the from Fast.ai.imports isn’t there and bcolz issue anymore though :slight_smile:


(ecdrid) #412

If I am not wrong then your current directory isn’t correct?

Try doing a cd into courses and then either ml or dl and then it should work?


(Jeremy Demlow) #413

I have been trying all types of combination of these using it as an admin and a non-admin to see if that was the case this is what i did and still got the same result with no module named ‘fastai’. Should my path have Anaconda3?

At this point i can get to the juypter notebook the library works, but the bash commands aren’t working.

I believe that this is because jupyternotebook in running power shell and this could be why this isn’t working so i may need to use powershell code in the terminal to get the bulldozer data. I will work on finding out the equivalents to BASH -> Powershell and see if this is the case. I would of course love to use the BASH commands if that is possible, but I don’t know how to go about doing that

AWESOME!! Got it working, but it’s very slow compared to paperspace this was very much worth doing and was successful


(ecdrid) #414

Its Windows, so we have to use !dir {PATH}

In Linux ls works…


(ecdrid) #415

(Sorry for spamming the forum)

Regarding the pdp plots,

How can we have -ve values for our prediction of the sale-prices even though RF’s hasn’t been trained on any?? (as far as i have read about RF’s they can only predict the values alike on what they have been trained)?

Thanks for replying…
@ramesh @jamesrequa @radek (sorry)(can’t remember more)


#416

I don’t think random forests can predict something they have not been trained on :wink:


(ecdrid) #417

So how are we getting those -ve lines in the PDP plots?

(Any justification)


#418

Sorry, I have no clue what it is that you linked to. I remember seeing this graph in the ML lectures but don’t recall what it was used for.

The way random forests work is you present them training examples and a target variable - they will not work otherwise (they use the target variables during training, the splitting of branches).


(ecdrid) #419

I got this diagram on a different dataset


#420

Sorry, I do not recall the discussion of partial dependence plots from the ML course. Something I have not managed to get around to study yet to the extent that I would like


(ecdrid) #421

It’s like you replace let’s say a particular year value in all the rows (make them same for all the rows in the dataset) and then predict what the RF will predict as the sale price,

Doing this for different years will help us to get the Partial dependence…

(Hope I am correct)


(Alex Lee) #422

So I was watching lesson 5 and 6 and I saw Jeremy recommended splitting the data into training and valid sets manually when the data has temporal ordering. TIL there’s a cross-validator in sklearn called TimeSeriesSplit.

Have anyone tried that out?