-
IcliniqPro
IcliniqPro is a dataset for medical dialogue, derived from iCliniq. -
Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers
The dataset consists of approximately 10 million question-answer pairs from multiple languages covering diverse fields such as math and language, and strong variation in... -
Learnings from Data Integration for Augmented Language Models
The dataset used in the paper is a collection of questions and answers for augmented language models. -
bAbI Movie Dialog dataset
The bAbI Movie Dialog dataset is a synthetic dataset containing elementary tasks such as selecting an answer between one or more candidate facts, answering yes/no questions,... -
AstroMLab 1: Who Wins Astronomy Jeopardy!?
A comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset. -
ASQA and ELI5 datasets
The ASQA and ELI5 datasets are used for evaluating the performance of large language models on the task of positional fine-grained citation generation. -
Expected Policy Gradients
The authors used the SimpleQuestion dataset as a test environment for their expected policy gradient method. -
NaturalQuestions-Open WebQuestions CuratedTREC
Open-domain QA datasets for testing the proposed method -
SQuAD: 100,000+ Questions for Machine Comprehension of Text
The SQuAD dataset is a benchmark for natural language understanding tasks, including question answering and text classification. -
ChroniclingAmericaQA
ChroniclingAmericaQA is a large-scale question-answering dataset comprising 487k question-answer pairs over a collection of historical American newspapers with the objective of... -
Propositional Horn Clause Logic Dataset
The dataset used in this paper is a collection of questions and answers related to logic programming, with a focus on propositional Horn clause logic. -
Room-to-Room (R2R) dataset
The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three... -
WebQuestions
The task of Question Answering over Linked Data (QALD) has received increased attention over the last years (see the surveys [14] and [36]). The task consists in mapping natural... -
InsuranceQA
The InsuranceQA dataset is a question answering dataset containing questions and answers.