513 B
513 B
CSE5519 Advances in Computer Vision (Topic D: 2021 and before: Image and Video Generation)
High-Resolution Image Synthesis with Latent Diffusion Models.
Image synthesis in high resolution.
Novelty in Latent Diffusion Models
Transformer encoder for LDMs
use cross-attention to integrate the text embedding into the latent space.