Is download_images function broken?

nlim · November 7, 2018, 8:30pm

I’m running it on my local machine

lesscomfortable · November 7, 2018, 9:38pm

Is it able to download a few images? Or does it break before printing anything?

ptah · November 7, 2018, 9:42pm

I get the same problem and I just updated it again. I am running on my local machine

NameError Traceback (most recent call last)
in
----> 1 download_images(path/file, dest, max_pics=200)

NameError: name ‘download_images’ is not defined

ptah · November 7, 2018, 9:45pm

anaconda-navigator shows i am on 1.0.4, with no option to update. conda update says it is the newest version too

ptah · November 7, 2018, 10:08pm

I am installing now with conda install -c fastai fastai=1.0.20 to force new version

ptah · November 7, 2018, 11:03pm

now i’m getting

ModuleNotFoundError Traceback (most recent call last)
in ()
----> 1 from fastai import *
2 from fastai.vision import *

ModuleNotFoundError: No module named ‘fastai’

nkrasnov · November 8, 2018, 12:04am

In lesson_2 download notebook there was no lines for autoupdate packages. If you’ve run your notebook and then understood that fastai library was out of date I think these lines may help you… Anyway it helps me
%reload_ext autoreload
%autoreload 2
%matplotlib inline

ptah · November 8, 2018, 7:41pm

I got it working now, i recreated fast.ai environment and installed jupyter into it too

Mauro · November 13, 2018, 11:29am

@lesscomfortable is there a way to modify this function so that it can download more than 400 images?

lesscomfortable · November 13, 2018, 7:05pm

If you scroll down in Google until you can’t anymore and then run the script you should have ~700 pictures. Google does not allow more images to be displayed. If you want more images, you can run the script on one Google Search and then run it again on a related one (e.g. searching in two languages like ‘tiger’/‘tigre’ or searching related terms like ‘dish’/‘plate’).

Mauro · November 13, 2018, 7:54pm

It’s not Google. The maximum the function downloads is 400 images even if you set it to 700 images and use 700 urls.

lesscomfortable · November 13, 2018, 8:01pm

Does it? Are you sure you have >400 urls? Did you set max_pics to 400?

Mauro · November 13, 2018, 8:53pm

You’re right. It must have been my number of urls. I just downloaded 900 images.

Borz · December 4, 2018, 5:09am

Adding here since it’s related to download_images: (fastai 1.0.32)

Looks like if you can’t write to a dictionary if you use more than one thread/process (max_workers > 1).

I was trying to save filename:url mappings to a dictionary so I could easily remove urls after deleting images in my file explorer. I made up an ImageDownloader class and threw download_images, _download_image_inner and download_image in there, then editted it a bit to keep track of what was downloaded with a dictionary.

Nothing gets written to the dictionary if max_workers is greater than 1, although within the scope of download_image, everything does get written. I looked around and found this stackoverflow thread, so I guess this is normal. If max_workers is -1, 0, or 1 then the dictionary does get populated with filenames and urls.

edit: as a note, if you want to keep track of the filename – url mapping when using download_images and run at full speed with multiple processes, you can just modify download_image to print out the filename and url upon a successful download.

karanbangia14 · February 10, 2019, 11:28am

The download_image function is just replacing the previous images in the destination …What can i do??

jelmoodjasser · July 12, 2019, 6:24am

In my case I’ve just added:
from fastai.vision.data import *
This has solved the problem for me.
I hope it helps.

lightning · November 4, 2019, 4:56am

Yes each time you open the jupyter notebook - need to re-import the library and subsequent commands

Saumil · March 27, 2020, 3:04pm

Hi. I am facing the same issue. While downloading the images from different url lists the previous image in the destination get overwritten.
Have you found out the solution for this problem?

Edit: I found a work around this problem by concatenating the urls in a single list and then using the function download_images

ryanwtyler · December 29, 2020, 4:54pm

I got this error:
“NameError: name ‘download_images’ is not defined”

running the lesson2-download notebook on my machine.

fastai version 2.1.10

I have tried from fastai.vision.data import * and

%reload_ext autoreload
%autoreload 2
%matplotlib inline

any help would be appreciated.

PS i apologize if i have asked in the wrong thread or made some other noob mistake.

edit:

from fastai.vision.utils import * fixed it but now i get:
IsADirectoryError: [Errno 21] Is a directory: 'bears/grizzly' after running:

download_images(path/file, dest, max_pics=200)

muellerzr · December 29, 2020, 6:21pm

The new version of fastai uses a .all module when importing like that. So you should try:

from fastai.vision.data.all import *

Although in this case you may be better off doing:

from fastai.vision.all import *