NoteNextra-origin/content/CSE5519/CSE5519_D1.md

# CSE5519 Advances in Computer Vision (Topic D: 2021 and before: Image and Video Generation)

## High-Resolution Image Synthesis with Latent Diffusion Models.

[link to the paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Rombach_High-Resolution_Image_Synthesis_With_Latent_Diffusion_Models_CVPR_2022_paper.pdf)

Image synthesis in high resolution.

### Novelty in Latent Diffusion Models

#### Transformer encoder for LDMs

use cross-attention to integrate the text embedding into the latent space.