Share your work here ✅

FraPochetti · May 2, 2022, 8:14am

Great writeup man.
Quick suggestion.
You do the following:

accuracy = (gts == preds.argmax(dim=1))
sum([1 for item in accuracy if item]) / len(preds)

Turns out accuracy is a tensor of Booleans, e.g. True or False so you don’t need the list comprehension to check whether you have a 1 or not.

You can just do

accuracy = (gts == preds.argmax(dim=1))
sum(accuracy) / len(preds)

Iyusuf1512 · May 2, 2022, 8:29am

Hi all,

Looking to build, and then share, a recommendation system on this forum using the collaborative filtering packages available in fastai. However, I’m seeking your help to collect the necessary data.

The purpose of the recommendation system is to recommend travel destinations based on individual user profiles. The drive to develop such a model stems from the fact that my fiance and I are getting married at the end of the year, and we’re unsure where we’d like to go for our honey moon. The recommendation system will therefore provide a list of country recommendations based on our individual user profiles.

If you’re interested in helping out collect the necessary data, please fill out this survey, and share with others as well, if possible: Travel Destination Rating Survey | SurveyPlanet

Assuming enough survey results come through, I’ll share the recommendation system on this page once complete

Thanks all!

PoonamV · May 2, 2022, 8:48am

Backstory

My kid and I keep on surfing the internet for various dinosaurs and their features and habitats and nesting patterns.
He also builds dinos from his lego blocks and building blocks, occasionally does drawing of them.
Recently he is working on building a Jurassic world with all his toys.

I thought it will be fun to build a prehistoric dinosaur identifier for this week’s homework for him to play around with.

I used seresnext50 model and fine-tuned it for 20 epochs.
I was able to achieve 0.248869 error rate. Blog post for more detailed analysis is under construction.

Hope you enjoy it.

bijilsubhash · May 2, 2022, 9:04am

I wonder if you can use something like a saliency map to figure out exactly what the model is learning or is there anything better out there?

strickvl · May 2, 2022, 9:14am

Had to look up what that is. That’s an interesting idea. No idea how I’d implement that with fastai, however. From what I remember of the course and the library, there are some helper functions built into fastai that will at least allow me to determine which examples it found hardest to determine. That might be a start in this general direction of looking under the hood.

suvash · May 2, 2022, 10:18am

you can quickly try out using the interpretation object helpers.

jeremy · May 2, 2022, 10:25am

Search for ‘gradcam’ in the book.

devforfu · May 2, 2022, 12:45pm

Also, there is an interesting library for this. Should be as simple as:

cam = GradCAM(model=model, target_layers=target_layers)

(In case if you would like to use it with a “plain” torch model.)

prairieguy · May 2, 2022, 1:13pm

Thank you @jeremy. I will give try reducing image size and model size.

prairieguy · May 2, 2022, 1:18pm

Nice model and very well done write-up.

edwardjross · May 2, 2022, 3:19pm

I built Emotion Escape a choose-your-own adventure where you choose your path by uploading images of facial expressions.

I trained a model to detect emotions in a Kaggle notebook. I found getting good images of emotions through DDG search tricky, so I used an existing dataset, a sample of AffectNet with 500 images per class. This data is also pretty noisy, but resnet34 got an error rate around 30% with 5 classes.

Then I deployed it as a Gradio app on huggingface spaces.

Inspired by the Javascript Interface thread I made a little Javascript adventure game that uses the emotion classifier to move between rooms. I pushed it to github and deployed it on github pages.

The game could use a lot of polish, especially around making it easier to capture images from a camera and resize the images before uploading. Currently it requires having some images with facial expressions downloaded and uploading them from a file browser; but it works reasonably well.

strickvl · May 2, 2022, 6:42pm

Well, that was a journey! I have relatively little idea of what all this means (chapter 18!), but at least it does seem that the model is activating (if that’s the right word / phrasing) on Mr Blupus and not the pillow or the background.

Thanks for the nudge to try that out. I would never have dared, otherwise, and it gives a bit of extra motivation to do the work to get to the point where I understand all the layers of what’s going on here.

kurianbenoy · May 2, 2022, 7:00pm

Created a huggingface spaces for classifying music according to genre. I had written a blog post on training model previously.

You can check out the model here and thanks for @suvash for explaining how to make hugging-face spaces work and explaining his code during delft-fastai meetup.

suvash · May 2, 2022, 8:17pm

Pretty neat that you have the audio → image(spectrogram) right in the inference code, and I see that you’re also using the huggingface+fastai integration functions.

wgpubs · May 2, 2022, 9:52pm

Did you include this in your notebook? If so … please do

Kind of amazing is that this is considered an advanced technique and waits for folks at the end of the book … and yet you figured out how to implement it after the first session of the course. Top > down learning works!

recurrent_abstinence · May 2, 2022, 10:51pm

Hi all,

I have created a classifier to detect if your house has been damaged by storms and have published a simple Gradio app on Huggingface spaces.

Thanks @suvash for the inspiration to explore Gradio and Huggingface Spaces.

Thanks @strickvl for hosting the delft-fastai study group, and good work using class activation maps to check how your model is working, I will definitely have to give this a go as well!

jeremy · May 2, 2022, 11:01pm

Yeah I’m super impressed how @strickvl just dived in and gave it a try!

stantonius · May 3, 2022, 12:26am

I took an idea I had when tinkering with a smart home around “private” computer vision - can I use images with detail stripped out and still build a model that can drive certain smart home tasks (ie. lights on/off when someone enters/exits a room). With limited time and using pretty much the standard set of hyperparameters that fastai suggests, I was able to get 85-90% accuracy in a simple multi-class classification using two different types of proxy camera filters.

The background and next steps for anyone interested can be found in this notebook, but the lesson here is that with models that are clearly not optimized for this task can still get good results in a couple hours of time.

This fastai/DL stuff is like magic - but unlike magicians, the fastai team actually reveal their secrets

BTW - I tried to host my notebook on Kaggle but I was encountering an error. Will repost there if I can resolve

kurianbenoy · May 3, 2022, 1:46am

Yeah, I kind of figured out the pipeline looking at notebooks made by @dhoa and played around with gradio audio inputs. It worked pretty smooth with the integration.

Yet one thing I noticed was two of the three gradio demos showing audio feature were having some issues.

Agreene · May 3, 2022, 1:55am

Hi everyone, I have enjoyed following lesson 1 and learning from the python code.

I made some quick modifications to the “Is it a bird?” kaggle notebook.

I was inspired by this clip by Stephen Colbert to answer the question:

Is Potato?

Uploading: image.png…

I also liked this one because some potato images look similar to rock from the image search.

There were some issues with this example becuase there were images of the “The Rock”.

Looking forward to meeting up with the other members of the Australia Study Group before today’s talk.