1 dataset found

Tags: Wikipedia Image Text

Filter Results
  • Wikipedia Image Text

    Wikipedia Image Text (WIT) dataset is a large-scale multimodal learning dataset used for training and evaluation of the MURAL model.
You can also access this registry using the API (see API Docs).