How to do fastai - Study plans & Learning strategies

Hereā€™s how Iā€™m downloading the class audio with or without the video. Working well.

Install youtube-dl https://rg3.github.io/youtube-dl/

From the command line (in the folder to download into).

Video and audio:
youtube-dl "url_of_video"

Audio only:
youtube-dl -f 140 "url_of_video"

URLs from Part 1
Lesson:
6: https://www.youtube.com/watch?v=hkBa9pU-H48
7: https://www.youtube.com/watch?v=9spwoDYwW_I

Other download format info available in [this answer].(https://askubuntu.com/a/486298/860226)

10 Likes

A lot of great ideas here, not much to add to them.

The one thing Iā€™d like to offer from my experience is that I have never learnt much from running dog classifiers or identifying digits. In fact, the same examples used over and over were at some point pushing me away from all this. Even though I understand their importance in offering a simple and concise example, they wonā€™t help you take the next step in the learning process.

My way of learning has been a sort of transfer knowledge process where I take insights from the awsome work Jeremy and the team are doing, trasnfer it over to my domain and its specific problems that move me, and develope ā€˜uniqueā€™ (to my domain at least) solutions to all that.

It has been the only way for me to not only absorb or memorize new knowledge but really make it mine. Hope it works for you.

Kind regards,
Theodore.

This discussion is quite livelyā€¦ so I am thinking that maybe some of you might be looking for mini projects to get your hands dirty with trainingā€¦

More information in this post. tldr: I think the datasets by fastai are an underappreciated resource. Started to create a starter pack for imagewoof. Was thinking of keeping the repo private as I work on this but am thinking that maybe someone might find a use for it already, maybe as a refresher before part 2ā€¦

Anyhow - whether you use this code or not is immaterial, but do check out the datasets here. They are a super valuable learning resource!

9 Likes

@radek This is definitely as good exercise/mini-project. I had completely missed that something like this can also be undertaken. Will take up one dataset at a time and update my GitHub :slight_smile:

1 Like

I need a project first and then go through the lessons. I think my biggest learning has happened after struggling through Kaggle competitions.

  1. It forces me to work on something
  2. I can compare with others
  3. I really see what appears to work and what doesnā€™t. I have made several models with a bad validation set that look to perform really well, but are horrible)

Working with tabular data I have probably, looked through Rossmann and the Adult Salary set a dozen times, which has gotten me about 80% there. The last 20% is a struggle to get rid of bad habits (not creating a smaller subset while testing architecture, model, feature engineering) and items I donā€™t understand/ask on the forums (categorylist vs floatlist, custom metrics, embeddings)

What I hope to give back more this round are some other starter packs for kaggle, similar to planet. For others to work and experiment through.

What a time to be alive! :smiley:
Just a few hours from the First lecture of Part 2 and I thought I should share my thoughts again here.

I spent the last few weeks as planned:

I donā€™t know how but somehow I managed to pull together all of the things that I had planned to do. Maybe itā€™s because I kept my hopes on the lower end but thatā€™s okay I think I will make sure not to wear myself out this time. During the last Part 2 runs, I would try to run too hard to keep up and then just fall flat on my face and give up on the lectures. Iā€™m determined not to do the same this time.

  • Managed to complete 3 runs of fastai, 2 all thanks to the twimlai meetups and 1 because, I had free weekends
  • I think Iā€™ve started to get there
  • So far Iā€™ve kept up with the Lesson discussions from part 1 and I hope to catch up or keep up with part 2 discussions. Special points if I can keep up in real time, lesser points if I get scared and donā€™t keep up and come back later.
  • Iā€™ve been doing a few souce code deep dives during meetups and presenting papers.
  • Iā€™m still yet to go back to the mini-ideas and start re-factoring. Mostly because Iā€™m still adding more ideas around it.

A few goals that Iā€™ll settle on for now:

  • Watch the livestreams live, keep up with the discussions and not just nod to the nb cells but actually ensure that I understand them okay-okay for the first run, I do plan to come back to the lessons later so I just hope to have a better idea than just feel alienated.
  • Iā€™ve promised to do a paper-a-week summary along with 1 ML hero interview each week, Luckily I teamed up with @lesscomfortable who has been doing a paper a day summary and it led me to writing 8 paper summaries in the past 6 days :smiley: I hope to stick to 1 paper a week after the course starts if not with a speed that is as intense speed as of now.
  • Presenting lesson ideas: Iā€™ve enjoyed learning via presenting via blogging and Iā€™ve really started to realise that by doing a mini-talk during the DS India meetups (hosted by @aakashns) or during the TWiMLAI meetups has really help me deepen my understanding. Iā€™ve somehow managed to do ~30 hours (my goal for 2019 is 100 hours) of presentations and I want to keep doing 1 mini talk or presentation on the ideas that I learn in part 2 each week.
  • Capstone Project: Iā€™m still yet to fall in love with an idea or a paper that I really want to implement. Again, Iā€™m just a CS undergrad student so that makes me a boring student with neither coding expertise (which is ofcourse my fault) or a cool background. So Iā€™m still waiting for my shower thought but Iā€™ll keep on working on mini-ideas and really come back to refactoring them in the future.

Another idea Iā€™ve realised that Iā€™d really want to stick to is: ensuring that my setup and DL env is clean. Iā€™ve done house cleaning today and updated both my machines to latest fastai and pytorch versions. Setup 2 env(s)- A bleeding edge one, everything installed from source and another for reliability (because I mess up, not because source installation is very hard) where I keep everything conda installed.

Again, Weā€™re all learning to learn so Iā€™ll be thankful for any suggestions/corrections to my approach.

Also for anyone who made it through my boring ideas, incase youā€™d want to tackle some compute heavy idea and would need some GeForce, please donā€™t hesitate to reach out to me :slight_smile:

3 Likes

Letā€™s hope that doesnā€™t get the norm, else the ML heros will not have time for MLā€¦ :wink:
More seriously, I like the interviews!

1 Like

Iā€™ve had success so far with this converter. Iā€™ve converted all the Part 1 lectures, and am in the process of converting the Intro to ML lectures. Iā€™d be happy to convert all the old classes to podcast and host them somewhere if @jeremy is okay with us converting them to mp3. Of course I wonā€™t do this for Part 2 until it is officially released. Let me know if this would be okay.

Sure feel free. BTW you can download just the audio with youtube-dl.

@MadeUpMasters i wanted to know if you are hosting these mp3 for the lectures that we can get access to?

Hey, sorry I slacked on this, they exceeded the filesize on github and I was waiting until Iā€™d set up a personal website to host. If you know a good place to share them freely please let me know and Iā€™ll upload them.

1 Like

No problem. What range of size are we talking?

1 Like

You can upload the files to drive / dropbox & share the links in github.

1 Like

Hey guys, sorry for any delay on this. Iā€™ve gone the google drive + GitHub route. The Part 1 2019 audio is up. Iā€™m going to add the intro to ML and Part 2 2018. Iā€™ll add Part 2 2019 upon request (or accept PRs from others who do it).

So far having listened to a few of the lessons for Part 1 in audio only mode, I have to say Iā€™m not a huge fan of listening podcast style. There are definitely certain parts that do really well as audio (general advice, talking about specifics like heuristics for tuning learning ratesā€¦etc) but for the most part it doesnā€™t seem like a really effective use of time. The argument from most people would be that you can do it while doing other stuff, but I personally prefer going really hard at learning for a fixed period daily and then letting it go to do stuff away from the computer and then come back refreshed. To each their own, so if this method helps you than the repo will be there!

Edit: Oops, forgot the link! Fastai Lesson Audio (mp3)

6 Likes

Something that has helped me with fastai has been to re implement each notebook, take it by the week instead of being in a rush to complete the course. I take fastai as a marathon and not a sprint and make sure that I finish and finish strong.
Write self notes and do research on things that are hard to understand and also spend a lot of time writing code and digging into fastai source code.

Now that part-2 is out, following this approach of doing one lesson per week or less but doing it really well is going to be my focus. Also doing homework and following the public mooc as though I am in class, really helps me. Fastai is overwhelming in terms of knowledge and there is so much to learn that if we hopped on from first lesson to another without spending time on a lecture first, it would be really hard to carry forward that knowledge.

3 Likes

To study the part 2 of 2019, is their a need for me to study part 2 of 2018. Please i need your response, in order to guide my way through. Thanks

Yes and no. It was built on an older version of the library, so some things may not port over well, but what Jeremy teaches there is still relevant today. Eventually we will get some more narrowed down courses (like the NLP one happening now, and a GAN one in the works) but for now it is an excellent course to go through to understand the advanced topics.

Thanks. Have studied part one. I can move on to this new part 2?

1 Like

Absolutely! Go on!

1 Like

Hello!
I know this is quite subjective to each oneā€™s preferences, but have you seen better results taking notes in the computer or by hand and then passing them to the computer? If you are using the computer, do you use notebooks or google docs/evernote?

Iā€™m one of these people that always carries a notebook (paper) in my pocket at all times and scribbles stuff in, but I think I never go back and read them or do anything with them (I really need to get a system ahaha), Iā€™m afraid this will happen with my course notes.

1 Like