-
Quora Question Pairs
The Quora Question Pairs dataset contains 404k English question pairs on Quora, created to test the abilities of the models to understand the semantics from text, and determine... -
DocVQA and ChartQA Datasets
The dataset used for testing the Vary-base model, containing DocVQA and ChartQA datasets. -
Visual Genome
The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships. -
SimpleQuestion dataset for adaptive learning
The dataset used in this paper is a collection of questions and answers related to adaptive learning and generative AI. -
SimpleQuestion dataset for Wikidata
The dataset used in this paper is a reinforcement learning dataset, specifically the SimpleQuestion dataset, which contains questions answerable using Wikidata as the knowledge... -
WikiTableQuestions
Semantic parsing maps a user-issued natural language (NL) utterance to a machine-executable meaning representation (MR), such as λ−calculus (Zettlemoyer and Collins, 2005), SQL... -
AlpacaFarm
The AlpacaFarm dataset is a large-scale dataset for preference optimization, which consists of a set of instructions and their corresponding responses. -
EntailmentBank
The dataset used in the paper to evaluate the REFLEX system, consisting of multiple-choice questions with entailment relationships.