-
TREC dataset
The dataset used in the paper is the TREC dataset, which consists of 124 queries. -
SQuAD: 100,000+ Questions for Machine Comprehension of Text
The SQuAD dataset is a benchmark for natural language understanding tasks, including question answering and text classification. -
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language
FarFetched is a modular framework that enables people to verify any kind of textual claim based on the incorporated evidence from textual news sources. -
CounterFact
The dataset used in the paper is a collection of irrelevant questions that are more challenging than the ones in existing datasets. -
bAbI Question Answering dataset
The bAbI Question Answering dataset is a benchmark for evaluating the ability of RNNs to answer questions. -
Brilla AI Dataset
A dataset of NSMQ contests from 2012-2022 containing videos of the contest and corresponding metadata, text form of riddles questions, and open-source science textbooks. -
WebQuestions
The task of Question Answering over Linked Data (QALD) has received increased attention over the last years (see the surveys [14] and [36]). The task consists in mapping natural... -
InsuranceQA
The InsuranceQA dataset is a question answering dataset containing questions and answers. -
Conversational dataset
The conversational dataset is used to evaluate the performance of the proposed algorithms. The dataset consists of 20,000 questions and answers, where each question is answered... -
TruthfulQA
The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods. -
SNLI dataset
The dataset used in the paper is the SNLI dataset. -
Question Answering Based Clinical Text Structuring
Clinical text structuring is a critical and fundamental task for clinical research. Traditional methods such as task-specific end-to-end models and pipeline models usually... -
QQP Dataset
The QQP dataset contains more than 400k question pairs.