Latent Diffusion Models: The Architecture behind Stable Diffusion
Last Updated on July 18, 2023 by Editorial Team
Author(s): Louis Bouchard
Originally published on Towards AI.
A High-Resolution Image Synthesis Architecture: Latent Diffusion
Originally published on louisbouchard.ai, read it 2 days before on my blog!
https://www.youtube.com/embed/RGBNdD3Wn-g
What do all recent super-powerful image models like DALLE, Imagen, or Midjourney have in common? Other than their high computing costs, huge training time, and shared hype, they are all based on the same mechanism: diffusion.
Diffusion models recently achieved state-of-the-art results for most image tasks, including text-to-image with DALLE but many other image generation-related tasks too, like image inpainting, style transfer, or image super-resolution.
There are a few downsides: they work sequentially on the whole image, meaning that both the training and inference times are expansive. This is why you… Read the full blog for free on Medium.
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI