WIT: Wikipedia-based image text dataset for multimodal multilingual machine learning
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Krishna Srinivasan, Karthik Raman, Jiecao Chen, Michael Bendersky, Marc Najork (2024). Dataset: WIT: Wikipedia-based image text dataset for multimodal multilingual machine learning. https://doi.org/10.57702/gjvr2zfc
DOI retrieved: December 3, 2024
Additional Info
Field | Value |
---|---|
Created | December 3, 2024 |
Last update | December 3, 2024 |
Defined In | https://doi.org/10.48550/arXiv.2301.10172 |
Author | Krishna Srinivasan |
More Authors |
|
Homepage | https://arxiv.org/abs/2103.01913 |