SBU Captions

The SBU Captions dataset is a large-scale image-text dataset used for vision-language pre-training.

Data and Resources

Cite this as

Sunan He, Taian Guo, Tao Dai, Ruizhi Qiao, Chen Wu, Xiujun Shu, Bo Ren (2024). Dataset: SBU Captions. https://doi.org/10.57702/hc8227lz

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2208.04060
Citation
  • https://doi.org/10.48550/arXiv.2208.09374
  • https://doi.org/10.48550/arXiv.2310.19654
  • https://doi.org/10.48550/arXiv.2211.15398
Author Sunan He
More Authors
Taian Guo
Tao Dai
Ruizhi Qiao
Chen Wu
Xiujun Shu
Bo Ren
Homepage https://ai.stanford.edu/~aishwarya/data/sbu_captions/