NLVR2 and OKVQA-S
NLVR2 is a challenging VQA dataset that requires the model to compare, locate, and count objects based on the given question and images. OKVQA-S is a challenging category of OKVQA that requires the model to use external knowledge, such as common sense, and facts, to answer questions about different topics and scenarios.
BibTex: