Cite this as

Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang (2024). Dataset: Image Captioning and Visual Question Answering. Resource: Original Metadata. https://doi.org/10.57702/96mwaviz

DOI retrieved: December 17, 2024

Additional Information

Field Value
Created December 17, 2024
Last updated December 17, 2024
Format JSON