CC12M dataset

CC12M dataset is used for training and testing the proposed method. It contains 12 million images with 12 million captions.

BibTex: