VQA 2.0

doi:doi:10.57702/5f5fx0ji

VQA 2.0

The VQA 2.0 dataset is used for visual question answering task. It consists of three sets with a train set containing 83k images and 444k questions, a validation set containing 41k images and 214k questions, and a test set containing 81k images and 448k questions.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Prajjwal Bhargava (2024). Dataset: VQA 2.0. https://doi.org/10.57702/5f5fx0ji

DOI retrieved: November 25, 2024

Additional Info

Field	Value
Created	November 25, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2402.12846
Citation	https://doi.org/10.48550/arXiv.1806.00857
Author	Prajjwal Bhargava
Homepage	https://vqa.org/