VQA 2.0

The VQA 2.0 dataset is used for visual question answering task. It consists of three sets with a train set containing 83k images and 444k questions, a validation set containing 41k images and 214k questions, and a test set containing 81k images and 448k questions.

Data and Resources

Cite this as

Prajjwal Bhargava (2024). Dataset: VQA 2.0. https://doi.org/10.57702/5f5fx0ji

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2402.12846
Citation
  • https://doi.org/10.48550/arXiv.1806.00857
Author Prajjwal Bhargava
Homepage https://vqa.org/