You're currently viewing an old version of this dataset. To see the current version, click here.

SBU Captions

The SBU Captions dataset is a large-scale image-text dataset used for vision-language pre-training.

Data and Resources

This dataset has no data

Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Chen Wu, Xiujun Shu, Bo Ren (2024). Dataset: SBU Captions. https://doi.org/10.57702/hc8227lz

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2208.04060
Citation	https://doi.org/10.48550/arXiv.2208.09374 https://doi.org/10.48550/arXiv.2310.19654 https://doi.org/10.48550/arXiv.2211.15398
Author	Sunan He
More Authors	Taian Guo Tao Dai Ruizhi Qiao Chen Wu Xiujun Shu Bo Ren
Homepage	https://ai.stanford.edu/~aishwarya/data/sbu_captions/