Image Captioning and Visual Question Answering

doi:doi:10.57702/96mwaviz

Image Captioning and Visual Question Answering

The dataset is used for image captioning and visual question answering.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang (2024). Dataset: Image Captioning and Visual Question Answering. https://doi.org/10.57702/96mwaviz

DOI retrieved: December 17, 2024

Additional Info

Field	Value
Created	December 17, 2024
Last update	December 17, 2024
Defined In	https://doi.org/10.48550/arXiv.2108.07140
Author	Peter Anderson
More Authors	Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould Lei Zhang