Get_preds() GPU utilization problem

FCFC · September 19, 2023, 2:56pm

Hey everyone,

We are experiencing a bottleneck with the get_preds() function while using a custom model for an image classification problem. In short, the GPU usage is too low.

Our workflow is the following:
imgs = get_image_files(image_dir)
learn = load_learner(f'{modelname}.pkl')
dl = learn.dls.test_dl(imgs, bs=batch_size, device=torch.device('cuda'))
predictions = learn.get_preds(dl=dl)

VRAM usage increases with the batch size, but GPU utilization is typically at 1%. Any suggestions about why this might be the case?

Thanks!

AllenK · September 19, 2023, 8:11pm

Try with ‘cpu=False’ in load_learner.

FCFC · September 20, 2023, 2:54pm

Oh, that did it, thanks!

FCFC · November 22, 2023, 1:51am

Going to ping this again, because I found that although it works perfectly with Ubuntu, this does not work on Windows.

When you run learn.get_preds on Windows, it first uses CPU ram and then runs the analysis using the GPU, which makes it much slower than on Ubuntu (where everything occurs in the GPU). Any suggestions why this might be different between the two OS?

Thanks!