Dataset Groups Activity Stream Conceptual Captions 3M, Conceptual Captions 12M, RedCaps, and LAION-400M The dataset used in the paper is Conceptual Captions 3M (CC3M), Conceptual Captions 12M (CC12M), RedCaps, and LAION-400M. BibTex: @dataset{Lijie_Fan_and_Dilip_Krishnan_and_Phillip_Isola_and_Dina_Katabi_and_Yonglong_Tian_2024, abstract = {The dataset used in the paper is Conceptual Captions 3M (CC3M), Conceptual Captions 12M (CC12M), RedCaps, and LAION-400M.}, author = {Lijie Fan and Dilip Krishnan and Phillip Isola and Dina Katabi and Yonglong Tian}, doi = {10.57702/gb9yyxtt}, institution = {No Organization}, keyword = {'dataset', 'image captioning', 'image-text pairs', 'vision-language models'}, month = {dec}, publisher = {TIB}, title = {Conceptual Captions 3M, Conceptual Captions 12M, RedCaps, and LAION-400M}, url = {https://service.tib.eu/ldmservice/dataset/conceptual-captions-3m--conceptual-captions-12m--redcaps--and-laion-400m}, year = {2024} }