A diagnostic VQA dataset based on abstract objects that enables a faster and less biased evaluation of spatial reasoning behavior in VQA compared with the original GRiD-3D dataset.
A comprehensive, simpliļ¬ed diagnostic VQA dataset with abstract objects that shows similar behavior to the original GRiD-3D dataset when learned by the two established VQA...
SPARE3D is a dataset for spatial reasoning on three-view line drawings. It contains five spatial reasoning tasks in three categories of increasing difficulty.
Visual Spatial Reasoning (VSR) is a controlled probing dataset for testing vision-language models' capabilities of recognizing and reasoning about spatial relations in natural...