15 lines
513 B
Markdown
15 lines
513 B
Markdown
# CSE5519 Advances in Computer Vision (Topic D: 2021 and before: Image and Video Generation)
|
|
|
|
## High-Resolution Image Synthesis with Latent Diffusion Models.
|
|
|
|
[link to the paper](https://openaccess.thecvf.com/content/CVPR2022/papers/Rombach_High-Resolution_Image_Synthesis_With_Latent_Diffusion_Models_CVPR_2022_paper.pdf)
|
|
|
|
Image synthesis in high resolution.
|
|
|
|
### Novelty in Latent Diffusion Models
|
|
|
|
#### Transformer encoder for LDMs
|
|
|
|
use cross-attention to integrate the text embedding into the latent space.
|
|
|