How to get predictions for all images in a folder?

Hadus · March 23, 2020, 4:59pm

How can we get predictions for all images in a folder efficiently?

I have trained a model on MNIST and it is working pretty well.
I can use learn.predict to get a prediction on a single image.

I tried looping through the images in the folder and running learn.predict but it was way too slow:

files = !ls "mnist_data/test"
preds = []
for file in tqdm(files):
    number, n_th, probs = learn.predict(f"mnist_data/test/{file}")
    preds.append(n_th)

So instead I decided to gather the images into a numpy array:

files = !ls "mnist_data/test"
imgs = []
for file in tqdm(files):
    with Image.open(f"mnist_data/test/{file}") as img:
        imgs.append(np.array(img))

imgs = np.array(imgs) # shape: (28000, 28, 28)

The next step would be to make batches from imgs and get predictions for them. But I am not sure how to get the predictions for a batch (using the same stats transforms etc…).

learn.predict is definitely not the right function for this according to the docs.

muellerzr · March 23, 2020, 5:02pm

get_preds I think is what you’re wanting

Hadus · March 23, 2020, 5:16pm

Learner.get_preds
Get the predictions and targets on the ds_idx -th dbunchset or dl , optionally with_input and with_loss

Then the question becomes:
How do we make a DataLoader with the same stats as the one we trained on?

There should be an easy way to do this…

I make the DataLoaders like so:

mnist = DataBlock(
    blocks=(ImageBlock, CategoryBlock), 
    get_items=get_image_files, 
    splitter=RandomSplitter(valid_pct=0.2, seed=42),
    get_y=parent_label)

mnist = mnist.new(batch_tfms=(Warp(), Zoom(), Rotate()))
dls = mnist.dataloaders("mnist_data/train")

muellerzr · March 23, 2020, 5:26pm

We have a test_dl where you pass in the items you want to use as your test set.

For instance here you would do:

test_dl = dls.test_dl(get_image_files('mnist_data/train'))
(where dls is your original dataloader)

and you can then just do learn.get_preds(dl=test_dl)

Hadus · March 23, 2020, 5:27pm

That is exactly what I was looking for. Thanks so much

szhou41 · March 24, 2020, 12:42am

Here is my solution:

def load_imgs(path):
     image_files = []
     for file in os.listdir(path):
         if file.endswith('.jpg'):
             image_files.append(path+'/'+file)

     return image_files

test_path_lily = 'Data/test/1'

test_img_lily = load_imgs(test_path_lily)

uploader_lily = SimpleNamespace(data = test_img_lily)

def do_test(model, uploader, num):
     for i in range(num):
         img = PILImage.create(uploader.data[i])
         predict = model.predict(img)
         print(f"loop {i}, {uploader.data[i]}, {predict[0]}, {predict[2][0]}, {predict[2][1]}")

do_test(learn, uploader_lily, len(test_img_lily))

szhou41 · March 24, 2020, 12:44am

This is so elegant!

Hadus · March 24, 2020, 12:55am

The problem with this is that it takes more than a day to run on my data. While the way we are supposed to do it (as @muellerzr pointed it out) takes less than a minute.

szhou41 · March 24, 2020, 1:37am

Yes, my way takes very very long time.

faib · March 25, 2020, 2:00pm

learn.validate() outputs a list with two elements. I have not yet understood what those two numbers mean or when to use the function.
Can anyone explain this in more detail?

Hadus · March 25, 2020, 2:17pm

In the docs:

Return the calculated loss and the metrics of the current model on the given data loader dl . The default data loader dl is the validation dataloader.

So the first value is loss and the second is a metric.

You can check what metrics you have with:

str(learn.metrics)

faib · March 25, 2020, 2:22pm

Ah, I was looking at the new docs, which didn’t give me much information, thanks!

miwojc · March 25, 2020, 2:46pm

The only issue for me with learn.get_preds(dl=test_dl) is that it crops image to same format as train and valid set which is square. No big deal for classification but not good for segmentation where you want your full image size as output from prediction.

nchukaobah · April 8, 2020, 1:46am

This was very helpful. Thanks

cereal.runner · October 1, 2020, 4:55pm

Do you know how to get around this for segmentation?

florianl · October 1, 2020, 8:39pm

you can do the following to change the resize transform for prediction (thanks @muellerzr for helping me figure this out).

x=224
y=336

test_dl = dls.test_dl(get_image_files(‘mnist_data/train’))
# change the transforms
test_dl.after_item = Pipeline([Resize((x,y)),ToTensor])
test_dl.after_batch = Pipeline([IntToFloatTensor(),Normalize.from_stats(*imagenet_stats)])

leozitor · November 24, 2020, 4:24pm

The output of the preds is the tensor of the probabilities, is there a built in fuction that converts to classes? or it needs to be implemented by hands? because the learn.predict to given sample gives the class predicted, but using learn.get_preds on test set doesn’t, this is strange, something is missing?