Cite this as

Zhiliang Peng, Wenhui Wang, Li Dong, Yaru Hao, Shaohan Huang, Shuming Ma, Furu Wei (2024). Dataset: Kosmos-2: Grounding multimodal large language models to the world. Resource: Original Metadata. https://doi.org/10.57702/elgkymz1

DOI retrieved: December 3, 2024

Additional Information

Field Value
Created December 3, 2024
Last updated December 3, 2024
Format JSON