Lesson 10 official topic

jeremy · October 15, 2022, 10:54pm

This is a wiki post - feel free to edit to add links from the lesson or other useful info.

<<< Lesson 9｜Lesson 11 >>>

Lesson resources

Lesson Videos
- Edited video
- Live stream (You can watch this any time during or after the session is complete)
Paper walkthrough video by @johnowhitaker covering Progressive Distillation for Fast Sampling of Diffusion Models
diffusion-nbs repo (we continue walking through stable_diffusion.ipynb that we touched upon last time)
Fashion-MNIST reimplementation of the lesson, with notes, by @strickvl
A brief video on the Cloudfare lava lamp random number generator

Links from the lesson

Course 2022p2 repo
Progressive Distillation for Fast Sampling of Diffusion Models
Imagic paper. Within a few hours stable diffusion versions are appearing.
APL : Array Programming topics. Array programming - fast.ai Course Forums

Student Notes

dan.heaford · October 18, 2022, 8:02am

Can we get a copy of your OneNote at the end please?

james.em · October 18, 2022, 8:05am

Hi Jeremy! I recently tried running the stable_diffusion.ipynb notebook locally (after a few days of struggling with CUDA setup). Running inference on the stable diffusion model gives a CUDA out of memory error. If anyone has any insights on how to solve it (can’t reduce batch size here, can I?) would be super helpful! Thanks.

ilovescience · October 18, 2022, 8:10am

What’s your GPU? Are you using float16?

nareshr8 · October 18, 2022, 8:11am

I reduced the image resolution to run in paperspace.

james.em · October 18, 2022, 8:11am

Hi Tanishq, I have a NVIDIA GeForce GTX 1650Ti, and yes I am using float16 too.

james.em · October 18, 2022, 8:12am

I should also mention that the same code (a image generation for “astronaut on a horse”) ran easily on a Paperspace free GPU instance, but didn’t work locally.

ganesh.bhat · October 18, 2022, 8:13am

@jeremy has uploaded it here - Lesson 9 official topic - #188

pcuenq · October 18, 2022, 8:15am

That card has 4 GB of VRAM IIRC. It’s a bit too little for Stable Diffusion, but perhaps you’d get lucky if you use pipe.enable_attention_slicing() after you create the pipeline. Could you try that out?

wyquek · October 18, 2022, 8:17am

I could run the Deep Dive notebook locally on 1080 8GB by breaking up some cells here. See if this helps you to run on 1650.

EDIT: 4GB is very low. When I checked nvidia-smi, its usually 5GB used at least, so it might not help

james.em · October 18, 2022, 8:18am

That’s right, it has 4GB, which I suspected was too low! Thanks for the advice, I’ll try that out. The thought process was, “Can I get the SD model to run inference locally, no matter how slowly?”. Just looking for ways to do that, just as we play with batch sizes and/or gradient accumulation in image classification models.

iamholmes · October 18, 2022, 8:19am

Latent noise looks different from a TV-grained type of noise. Is that what you meant when you showed us the Jeremy Howard image please? Thanks.

james.em · October 18, 2022, 8:20am

Ah, that’s interesting! Thank you for linking your version of it, I’ll check it out after the lecture

wyquek · October 18, 2022, 8:21am

4GB is very low. When I checked nvidia-smi, its usually 5GB used at least, so it might not help

miko · October 18, 2022, 8:21am

Probably not the right place, but cannot think of a better thread right now (will move the discussion if needed). Random thought: it should be in theory possible to apply the VAE + Unet on the latents on things like image segmentation as well to get maybe some faster results. Wondering what the group thinks about it

james.em · October 18, 2022, 8:22am

Oh, okay. Paperspace/Lambda it is, then! Thanks for your answers.

radikubwa · October 18, 2022, 8:25am

Has anyone tried running the notebooks with NVIDIA GeForce RTX 3070 Mobile / Max-Q 8GB RAM?

johnowhitaker · October 18, 2022, 8:28am

From the ‘Progressive Distillation’ paper showing quality (lower is better) for different numbers of steps comparing their distilled version with a non-distilled model sampled with DDIM. You can see the original models need more steps to get to decent quality.

ilovescience · October 18, 2022, 8:32am

Links to both the papers discussed: