Visual Spatial Reasoning

Visual Spatial Reasoning (VSR) is a controlled probing dataset for testing vision-language models' capabilities of recognizing and reasoning about spatial relations in natural image-text pairs.

Data and Resources

Cite this as

Fangyu Liu, Guy Emerson, Nigel Collier, Nigel Collier (2024). Dataset: Visual Spatial Reasoning. https://doi.org/10.57702/12wc0nfm

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2205.00363
Author Fangyu Liu
More Authors
Guy Emerson
Nigel Collier
Nigel Collier
Homepage https://github.com/cambridgeltl/visual-spatial-reasoning