SBU Captions

The SBU Captions dataset is a large-scale image-text dataset used for vision-language pre-training.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Chen Wu, Xiujun Shu, Bo Ren (2024). Dataset: SBU Captions. https://doi.org/10.57702/hc8227lz

DOI retrieved: December 2, 2024

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2208.04060
Citation	https://doi.org/10.48550/arXiv.2208.09374 https://doi.org/10.48550/arXiv.2310.19654 https://doi.org/10.48550/arXiv.2211.15398
Author	Sunan He
More Authors	Taian Guo Tao Dai Ruizhi Qiao Chen Wu Xiujun Shu Bo Ren
Homepage	https://ai.stanford.edu/~aishwarya/data/sbu_captions/