Image Captioning and Visual Question Answering

The dataset is used for image captioning and visual question answering.

Data and Resources

Cite this as

Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang (2024). Dataset: Image Captioning and Visual Question Answering. https://doi.org/10.57702/96mwaviz

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.2108.07140
Author Peter Anderson
More Authors
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang