Stable diffusion: resources and discussion

jeremy · September 26, 2022, 1:29am

Got questions, comments, links, or want to chat about Stable Diffusion? Do it here! Here’s some links to help get you started:

Review of latest Score Based Generative Modeling papers.
labml.ai Annotated PyTorch Paper Implementations
Stable Diffusion with Diffusers
Huggingface noteboooks
Simple diffusion from @johnowhitaker
Introduction to Diffusion Models for Machine Learning - AssemblyAI
Tutorial - What is a variational autoencoder?
“Grokking Stable Diffusion” from @johnowhitaker
Grokking SD Part 2: Textual Inversion from @johnowhitaker
What are Diffusion Models? · Lilian Weng
Generative Modeling by Estimating Gradients of the Data Distribution (Yang Song)
The Annotated Diffusion Model
Understanding VQ-VAE (DALL-E Explained Pt. 1)
Diffusers Interpret. Model explainability, could be adapted to show some nice instructive plots.

nain · September 26, 2022, 6:38am

Shameless plug: Diffusion Models [Aakash Nain, Sayak Paul, Rishabh]

jeremy · September 26, 2022, 6:57am

Here’s something cool that @yiyimarz did too:

yiyimarz · September 26, 2022, 7:07pm

Thanks for mentioning my project here! I feel so honored:) I’ve learned so much from fast.ai courses and am super excited to join this upcoming class!

jamesrequa · September 27, 2022, 5:38am

Concepts like textual inversion or dreambooth to me are possibly the most exciting and powerful extensions to stable diffusion. Being able to inject custom representations (with only 3-5 images!) into the text-to-image model and optimizing towards seemingly any novel concept provides incredible control over content generation.

RogerS49 · September 27, 2022, 6:27am

This subject is really interesting, given the vast amount of links here and within links to further papers/ pages perhaps a score process could be defined to indicate the ones that explain the subject best in the simplest terms, for me so far I give a vote for ‘Lilian Weng’, the ‘Yang Song’ intrigues me by the title only so far so next on my reading list.

After reading Yang Song paper I give that a vote also, note the first link in Jeremy’s original post is from that paper. In this paper Yang Song writes a commentary on the connections to different models in this sphere of knowledge.

I must comment that I am new to this topic and my suggestions are only that. Each link above reveals the number of it’s access clicks but does not say how useful the experience has been to the reader.

wyquek · September 27, 2022, 12:33pm

I got interested two weeks ago after I chanced upon @ilovescience 's youtube Diffusion Study Group #1 - EleutherAI ; so glad this is happening

miko · September 27, 2022, 2:00pm

I found this twitter thread particularly useful at giving me a good high level view of SD, and particularly around the “latent diffusion” idea, together with this other which is linked in the original one.

It gave me a good idea of the pieces involved and a basis for what the endgame is

tcapelle · September 29, 2022, 9:18am

@jeremy this could be useful to your course —> This is a very minimal implementation in PyTorch, I learned a lot watching this video: Diffusion Models | PyTorch Implementation - YouTube
I would also add the Keras implementation, it is super concise and clear:
GitHub - divamgupta/stable-diffusion-tensorflow: Stable Diffusion in TensorFlow / Keras

Is someone implementing SD training/finetunning on fastai?

wyquek · September 29, 2022, 10:10am

I like Outlier’s videos a lot; the previous one on diffusion was very good too, but requires a lot of let-me-rewind-20s-and-listen-to-that-again.
Going to watch this new one, thanks!

harikrishnanrajeev · September 29, 2022, 11:07am

harikrishnanrajeev · September 29, 2022, 11:07am

jamesrequa · September 29, 2022, 9:10pm

Just wanted to highlight a few cool new diffusion model techniques released this week!

MAKE-A-VIDEO: TEXT-TO-VIDEO GENERATION WITHOUT TEXT-VIDEO DATA
paper: https://makeavideo.studio/Make-A-Video.pdf
project: https://makeavideo.studio/

TRAINING-FREE STRUCTURED DIFFUSION GUIDANCE FOR COMPOSITIONAL TEXT-TO-IMAGE SYNTHESIS
paper: https://openreview.net/pdf?id=PUIqjT4rzq7
key idea: Improve SD prompt-adherence using cross-attention

DREAMFUSION: TEXT-TO-3D USING 2D DIFFUSION
paper: https://openreview.net/pdf?id=FjNys5c7VyY
project: https://dreamfusionpaper.github.io/
key idea: diffusion model for text to 3d

Even · September 30, 2022, 1:38am

I thought this article about using it for compression was super interesting as well!

yiyimarz · October 2, 2022, 4:39am

How do you finetune SD? use dreambooth?

jamesrequa · October 2, 2022, 5:34am

After seeing this tweet by Tanishq I knew something had to be done…

Using the HF SD Dreambooth training collab I added Jeremy as a new concept to SD. It took just 5 images of Jeremy that I found online, < 5 mins of total training time on a V100, and < 10s per image to generate these.

jeremy_rendering

sachinruk · October 2, 2022, 10:22pm

Just wondering are these pre-requisites for the course? Or simply just posting some links ahead of time?

jeremy · October 2, 2022, 10:53pm

simply just posting some links ahead of time

s.s.o · October 4, 2022, 6:45am

TabDDPM: Modelling Tabular Data with Diffusion Models

melonkernel · October 5, 2022, 5:05am

This link does not seem to work

Here is a snapshot from 27 september