You're currently viewing an old version of this dataset. To see the current version, click here.

SceneDiffusion Dataset

The dataset used in the paper is a large-scale dataset containing 1,000 text prompts and over 5,000 images associated with image captions, local descriptions, and mask annotations.

Data and Resources

Cite this as

Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu, Ziwei Liu, Tao Xiang, Antoine Toisoul (2024). Dataset: SceneDiffusion Dataset. https://doi.org/10.57702/4hjo4gjx

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2404.07178
Author Jiawei Ren
More Authors
Mengmeng Xu
Jui-Chieh Wu
Ziwei Liu
Tao Xiang
Antoine Toisoul
Homepage https://arxiv.org/abs/2304.03373