The dataset used in the paper is not explicitly described, but it is mentioned that the authors used MIT-States, Fashion200k, and COCO 2017 Panoptic Segmentation datasets for...
CLEVR images contain objects characterized by a set of attributes (shape, color, size and material). The questions are grouped into 5 categories: Exist, Count, CompareInteger,...