Visual Commonsense Reasoning (VCR)

VCR consists of 290k questions derived from 110k movie scenes, focusing on visual commonsense reasoning.

Data and Resources

Cite this as

Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi (2024). Dataset: Visual Commonsense Reasoning (VCR). https://doi.org/10.57702/dghte7ch

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1908.03557
Author Rowan Zellers
More Authors
Yonatan Bisk
Ali Farhadi
Yejin Choi