-
Abstraction and Reasoning Corpus (ARC)
A collection of heterogeneous visual reasoning data sets and an interesting benchmark for two reasons: First, visual reasoning programs tend to be large (in current program... -
Cora and Citeseer datasets
The Cora and Citeseer datasets are used for training machine learning models to classify documents into different categories. -
NLVR2 and OKVQA-S
NLVR2 is a challenging VQA dataset that requires the model to compare, locate, and count objects based on the given question and images. OKVQA-S is a challenging category of... -
Mixture of Rationales (MoR) for Visual Question Answering
Zero-shot visual question answering (VQA) is a challenging task that requires reasoning across modalities. While some existing methods rely on a single rationale within the... -
VQA-Introspect and VQAv2
The dataset used in the paper for Visual Question Answering (VQA) task, combining VQA-Introspect and VQAv2 datasets. -
Quora Question Pairs
The Quora Question Pairs dataset contains 404k English question pairs on Quora, created to test the abilities of the models to understand the semantics from text, and determine... -
SmartonAI dataset
The dataset used in the paper is a collection of user queries and corresponding responses generated by the SmartonAI plugin. -
LaMini: A Large-Scale Instruction Dataset
The LaMini approach involves generating a large-scale instruction dataset by leveraging the outputs of a large language model, gpt-3.5-turbo. -
SQUAD 2.0 and IMDB
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the SQUAD 2.0 dataset for Question-Answering and the IMDB dataset for Movie... -
Quora dataset for question classification
Quora dataset for question classification -
TREC dataset for question classification
TREC dataset for question classification