You're currently viewing an old version of this dataset. To see the current version, click here.

VQAv2

Visual Question Answering (VQA) has achieved great success thanks to the fast development of deep neural networks (DNN). On the other hand, the data augmentation, as one of the major tricks for DNN, has been widely used in many computer vision tasks.

Data and Resources

This dataset has no data

Cite this as

Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu (2024). Dataset: VQAv2. https://doi.org/10.57702/kob379ex

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2305.10722
Citation
  • https://doi.org/10.48550/arXiv.2007.09592
  • https://doi.org/10.48550/arXiv.2402.08756
  • https://doi.org/10.48550/arXiv.2402.08360
Author Yen-Chun Chen
More Authors
Linjie Li
Licheng Yu
Ahmed El Kholy
Faisal Ahmed
Zhe Gan
Yu Cheng
Jingjing Liu
Homepage https://huggingface.co/datasets/vqa2