TextCaps: A dataset for image captioning with reading comprehension

TextCaps: A dataset for image captioning with reading comprehension.

Data and Resources

Cite this as

Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh (2024). Dataset: TextCaps: A dataset for image captioning with reading comprehension. https://doi.org/10.57702/nekr98ye

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2105.03236
Author Oleksii Sidorov
More Authors
Ronghang Hu
Marcus Rohrbach
Amanpreet Singh
Homepage https://textvqa.org/textcaps