The GQA dataset is a visual question answering dataset that characterizes in compositional question answering and visual reasoning about real-world images.
CLEVR images contain objects characterized by a set of attributes (shape, color, size and material). The questions are grouped into 5 categories: Exist, Count, CompareInteger,...