Generating image datasets quickly


(Theodoros Galanos) #1

Hello everyone!

I was going to start my second pass on the fastai course yesterday and wanted to follow through the courses with my own case studies this time. While that hasn’t been difficult for structured data (my work involves a lot of that) or language models (shockingly there’s a lot of text out there for all of us), it has been a bit trickier for image datasets.

Yesterday I found a really easy way of going about that. It was all possible with the google images downloader firefox addon: https://addons.mozilla.org/en-US/firefox/addon/google-images-downloader/?src=recommended

This amazing little addon allows you to download ALL images on a google image search with the press of a button, even providing a text file including all image links. I know there’s probably dozens of ways to scrape images off the net, but this one was the most satisfying I’ve found to date and wanted to share.

As an example, in a matter of 20 mins I had 12,500 images of different architectural practices (around 700 images of buildings from ~20 firms) for my ‘architect studio classification’ project.

Hope this is useful to some of you!

Kind regards,
Theodore.


(Sudeep) #2

Thanks! I tried this extension and it seems to work, but it only downloaded around 80 images for the search term I had. Any suggestions on how to get it to grab more results for the same search term?


(chandan) #3

this has been super helpfull, thanks a lot and kudos to dev of add on.:vulcan_salute:


(chandan) #4

just scroll to end of images and preview 1000-2000 images before hitting image button.it will download all of em.