-
LLM dataset
The dataset used in this paper is not explicitly described, but it is mentioned that it is a large language model (LLM) and that the authors used it to train and evaluate their... -
SQuAD 1.1 and SQuAD 2
The SQuAD 1.1 and SQuAD 2 datasets are used to evaluate the performance of the EQuANt model. -
BIG-Bench Hard
The BIG-Bench Hard dataset is derived from the original BIG-Bench evaluation suite, focusing on tasks that pose challenges to existing language models. -
Leveraging QA Datasets to Improve Generative Data Augmentation
The paper proposes a method to leverage QA datasets for training generative language models to be context generators for a given question and answer. -
LDC2015E86
LDC2015E86 is a dataset of abstract meaning representation (AMR) annotations for English. -
SQuAD: 100,000+ Questions for Machine Comprehension of Text
The SQuAD dataset is a benchmark for natural language understanding tasks, including question answering and text classification. -
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language
FarFetched is a modular framework that enables people to verify any kind of textual claim based on the incorporated evidence from textual news sources. -
InsuranceQA
The InsuranceQA dataset is a question answering dataset containing questions and answers. -
QQP Dataset
The QQP dataset contains more than 400k question pairs. -
Semantics in Question Answering
Semanitic parsing on freebase from question-answer pairs -
RadQA: A question answering dataset to improve comprehension of radiology rep...
RadQA: A question answering dataset to improve comprehension of radiology reports -
Natural Questions
The Natural Questions dataset consists of questions extracted from web queries, with each question accompanied by a corresponding Wikipedia article containing the answer. -
MS MARCO: A Human-Generated Machine Reading Comprehension Dataset
The dataset is used for training and evaluating the MS MARCO model, a question answering model.