CC3M, SBU Captions, Visual Genome, and COCO

The dataset used in the paper is a combination of CC3M, SBU Captions, Visual Genome, and COCO.

Data and Resources

Cite this as

Jiho Jang, Chaerin Kong, Donghyeon Jeon, Seonhoon Kim, Nojun Kwak (2024). Dataset: CC3M, SBU Captions, Visual Genome, and COCO. https://doi.org/10.57702/n2m476ae

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2211.11153
Author Jiho Jang
More Authors
Chaerin Kong
Donghyeon Jeon
Seonhoon Kim
Nojun Kwak