The VQA-CP dataset is a split of the VQA dataset, designed to test generalization skills across changes in the answer distribution between the training and the test sets.
The GQA dataset is a visual question answering dataset that characterizes in compositional question answering and visual reasoning about real-world images.