-
ChartCheck
ChartCheck is a novel, large-scale dataset for explainable fact-checking against real-world charts, consisting of 1.7k charts and 10.5k human-written claims and explanations. -
Synthetic Visual Reasoning Test (SVRT)
The Synthetic Visual Reasoning Test (SVRT) dataset consists of 23 binary classification tasks, each defined by a particular configuration of relations. The tasks can be broadly... -
CLEVR-Robot Environment
A benchmark for evaluating task compositionality and long-horizon tasks through object manipulation, with language serving as the mechanism for goal specification.