Photorealistic text-to-image diffusion models with deep language understanding

doi:doi:10.57702/y1fq9iuk

Photorealistic text-to-image diffusion models with deep language understanding

The authors present a photorealistic text-to-image diffusion model with deep language understanding.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily L Denton, Kamyar Ghasemipour, Raphael Gontijo Lopes, Burcu Karagol Ayan, Tim Salimans (2024). Dataset: Photorealistic text-to-image diffusion models with deep language understanding. https://doi.org/10.57702/y1fq9iuk

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2210.12100
Citation	https://doi.org/10.48550/arXiv.2206.00169 https://doi.org/10.48550/arXiv.2312.02133
Author	Chitwan Saharia
More Authors	William Chan Saurabh Saxena Lala Li Jay Whang Emily L Denton Kamyar Ghasemipour Raphael Gontijo Lopes Burcu Karagol Ayan Tim Salimans
Homepage	https://huggingface.co/stabilityai/stable-diffusion-2-1-unclip