Lesson 9B: Math of Diffusion

ilovescience · October 13, 2022, 8:42pm

Here is an explanation directly from the lead author/developer of latent diffusion and Stable Diffusion:

We introduced the scale factor in the latent diffusion paper. The goal was to handle different latent spaces (from different autoencoders, which can be scaled quite differently than images) with similar noise schedules. The scale_factor ensures that the initial latent space on which the diffusion model is operating has approximately unit variance. Hope this helps

github.com/huggingface/diffusers

Explanation of the 0.18215 factor in textual_inversion?

opened 01:21AM - 09 Sep 22 UTC

closed 01:07PM - 09 Sep 22 UTC

garrett361

https://github.com/huggingface/diffusers/blob/b2b3b1a8ab83b020ecaf32f45de3ef2364…4331cf/examples/textual_inversion/textual_inversion.py#L501 Hi, just a small question about the quoted script above which is bothering me: where does this `0.18215` number come from? What computation is being done? Is it from some paper? I have seen the same factor elsewhere, too, without explanation. Any guidance would be very helpful, thanks!