Share your work here ✅

(Jeremy Howard (Admin)) #1

Show us what you’ve created with what you learned in! :slight_smile: It could be a blog post, a jupyter notebook, a picture, a github repo, a web app, or anything else. Some tips:

  • Probably the easiest way to blog is on Medium. If you use Medium, make sure you add your twitter username to your Medium profile, so that sharing will automatically credit you
  • The easiest way to share a notebook on github is to install the gist it extension. This will only be possible if you use a platform that supports jupyter extensions, such as GCP. Otherwise, you can create a notebook gist by clicking File->Download to get your notebook to your computer, and then follow the steps from this SO post:
    1. Go to
    2. Click ‘New Gist’ on the upper right corner
    3. Open the folder in a Finder/Explorer window on your local computer
    4. Drag the file into the text box (the ‘code space’). This should fill the space with JSON looking text for the
      framework of the notebook content.
    5. Copy/Paste the full file name (e.g., mynotebook.ipynb) into the filename box, and give a description above.
    6. Create the Gist!
  • If you want to have folks on the forum look at a draft and give feedback without sharing it more widely, just mention that in your post
  • You can also just use a reply to this topic to describe what you did - preferably pasting in a picture or two! :slight_smile:

Free GPU-Google --> Lesson 1 -HW- Alien vs. Predator images
Classifying Tesla cars
FAQ, resources, and official course updates ✅
Request: resource to writing a first blog post
Time series/ sequential data study group
Time series/ sequential data study group
Developer chat
Lesson 1 Discussion ✅
Getting to the bottom of the posts

I put together a Kaggle competition starter pack for a competition that is currently under way - Quick, Draw! There are still six more weeks before it finishes so that is plenty of time to play around.

You can find the starter pack on github here.

The competition has a lot of good things going for it. First of all, there do not seem to be any issues with the data. It is also one of the unusual competitions where you can let your imagination run wild - many, many different models can be useful here, potentially including RNNs. Given the submission format it should also be really easy to come up with some interesting way to ensemble the data - might be a fun thing to work on in a group.

If you have any starter code or competition related questions - please go to the thread on Kaggle forums and let’s please discuss it there. This way others who might have a similar question could also benefit. Same goes for sharing code - if there is any competition related code you would like to share, please do it in the open - sharing privately is against competition rules.

Enjoy and beware - these competitions can be addictive :wink:

PS. I have not made any references to class materials as to not share links before MOOC launch. The training notebook is based line for line on the lesson 1 lecture nb. Hope Jeremy will forgive me for the lack of proper attribution this time around.

(Jeremy Howard (Admin)) #3

Thanks for mentioning that! :slight_smile: I think it would be nice to say “this code is based on code from a MOOC that will be publicly available in Jan 2019” or something.

(Maria) #4

Is getting 99,9% accuracy on MNIST any good?
I’ve played a bit with first notebook and got that after some experiments accuracy. From what I’ve found this seems to be better than paper results:


Hi everybody!

While watching lesson 1 of the new course I was wondering where to get a big forest/nature related dataset to build an image classifier on, as this is the domain I am coming from.

I finally found the ImageCLEF Plant Identification Challenge 2013 which provides a already labeld training dataset containing images of 250 plant species on 10485 images (25GB). Most of the images are showing leafs but there also images of flowers, fruit, stem & the entire plant.

For the classifier I used the images with a uniform background (category=SheetAsBAckground) which only contain leaves: 4921 samples and 124 classes.

I started with training a pretrained resnet34 and already got the error rate down to ~3% after 17 epochs. Interestingly fine tuning didn´t help to improve accuracy/loss drastically.

What to do next:

  • Train network for category=NaturalBackground
  • Maybe exclude classes with samples < 10?

Below you can find a GIST of the notebook I used. I am looking forward to your feedback on what I can improve or what else could be done with this dataset :slight_smile:


Transfer Learning with pretrained fastai model
(Ilia) #6

Note that Quick Draw competition goes in two versions: (1) simplified and (2) raw. And the later one includes approx. 66GB of data. So in case if you’re downloading the data, you’d better specify which version of the dataset you’re going to use:

kaggle competitions download quickdraw-doodle-recognition -p ~/data/doodle -f test_simplified.csv
kaggle competitions download quickdraw-doodle-recognition -p ~/data/doodle -f

(Brian Muhia) #7

I’ve made a notebook to classify food pictures from the food-101 dataset. Here’s the notebook in nbviewer, and here’s the repo if you want to see the other experiments I’ve been running. I built a French language model using v1, and also have another experiment with food-101.

(Benoit) #8

Hello I’ve made a Notebook to classify airplaines, more info here :

(Keyur Paralkar) #9

I have written a synopsis of how to approach a given problem i.e. dataset and to get the end results. This is blog is at preliminary stage so suggestions are highly appreciated.

(Brian Smith) #10

I’ll do a broader write up on my blog as I progress - but my first attempt at differentiating downloaded images from Google (thanks @lindyrock for the tip) of American football, soccer, rugby, Australian rules football, ice hockey, field hockey, lacrosse, baseball, cricket, boxing, wrestling and athletics gave some fun results. A few hundred images of each, ResNet34 and just 4 cycles gave me 20% error rate. Initial observations - my data isn’t that clean - some search results weren’t what I was looking for (soccer/football - ice and field hockey) - so reviewing and cleaning datasets will help. Confusion matrix much as expected -

with the hockeys, wrestling/boxing and football/soccer most confused. Surprisingly rugby and Australian rules not as bad. A fun start!

(Fadhli Ismail) #11

Was looking at this competitions the other day. Thanks for sharing your work.

(Kai Lichtenberg) #12

I wrote a fun little notebook to create an image classifier just with specifying keywords. It’s using the google image search to find images for those keywords and trains a classifier on those images:

Classify images with Keywords

I tried to advance the emerging field of classifying Hot Dogs by specifically adding “Corn Dogs” to the portfolio.

Maybe it’s time for the ImageDataBunch.from_google Method :joy:

(Kaushik Jaiswal) #13

hey how did you added the dataset .

(Kaushik Jaiswal) #14

in line 3 how did you add the food data . like in Lesson-1 notebook , jeremy has defined a url constant for pet dataset .

(Brian Muhia) #15

I had downloaded the data separately, but I’ll update the notebook with the constant.

Edit: Done

(Kaushik Jaiswal) #16

have you put the data in a separate folder and just passed the path of that folder in line 3?

Image Similarity Search
(ritika) #17

@radek .Thank you for sharing your work. It is very helpful.

(Brian Muhia) #18

That folder is where I had downloaded it anyway, so the result will be the same, if my reading of the code is right.

(Likhit) #19

I really needed the script for removing corrupted images(pynoob :smile: ). Thanks!

(Kai Lichtenberg) #20

You’re welcome :grin: