-
CLEVR-Humans
The CLEVR-Humans dataset consists of 32,164 questions asked by humans, containing words and reasoning steps that were unseen in CLEVR. -
Fashionpedia-Taste
Fashionpedia-Taste is an explainable fashion taste dataset that challenges computer vision systems to predict whether a subject like a fashion image and provide explanations... -
Mutual: A dataset for multi-turn dialogue reasoning
A dataset for multi-turn dialogue reasoning.