CC-3M

CC-3M is a large-scale dataset of 300,000 image-caption pairs.

BibTex: