VQA

The VQA dataset is a large-scale visual question answering dataset that consists of pairs of images that require natural language answers.

BibTex: