Dataset - LDM

IcliniqPro

IcliniqPro is a dataset for medical dialogue, derived from iCliniq.
- Dataset
- JSON
PubMedPro

PubMedPro is a dataset for biomedical research question answering, constructed from academic QA scenarios.
- Dataset
- JSON
Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers

The dataset consists of approximately 10 million question-answer pairs from multiple languages covering diverse fields such as math and language, and strong variation in...
- Dataset
- JSON
Learnings from Data Integration for Augmented Language Models

The dataset used in the paper is a collection of questions and answers for augmented language models.
- Dataset
- JSON
bAbI Movie Dialog dataset

The bAbI Movie Dialog dataset is a synthetic dataset containing elementary tasks such as selecting an answer between one or more candidate facts, answering yes/no questions,...
- Dataset
- JSON
AstroMLab 1: Who Wins Astronomy Jeopardy!?

A comprehensive evaluation of proprietary and open-weights large language models using the first astronomy-specific benchmarking dataset.
- Dataset
- JSON
QALD

The QALD dataset is a benchmarking dataset for question answering systems over linked data.
- Dataset
- JSON
LC-QuAD

The LC-QuAD dataset is a gold standard complex question answering benchmark over the DBpedia 2016-04 release.
- Dataset
- JSON
ReCoRD

The ReCoRD dataset is a benchmark for reading comprehension tasks.
- Dataset
- JSON
ASQA and ELI5 datasets

The ASQA and ELI5 datasets are used for evaluating the performance of large language models on the task of positional fine-grained citation generation.
- Dataset
- JSON
Alpaca-2

The dataset contains questions answerable using Wikidata as the knowledge graph, focusing on questions with a single entity and relation.
- Dataset
- JSON
Expected Policy Gradients

The authors used the SimpleQuestion dataset as a test environment for their expected policy gradient method.
- Dataset
- JSON
NaturalQuestions-Open WebQuestions CuratedTREC

Open-domain QA datasets for testing the proposed method
- Dataset
- JSON
SQuAD: 100,000+ Questions for Machine Comprehension of Text

The SQuAD dataset is a benchmark for natural language understanding tasks, including question answering and text classification.
- Dataset
- JSON
ChroniclingAmericaQA

ChroniclingAmericaQA is a large-scale question-answering dataset comprising 487k question-answer pairs over a collection of historical American newspapers with the objective of...
- Dataset
- JSON
Propositional Horn Clause Logic Dataset

The dataset used in this paper is a collection of questions and answers related to logic programming, with a focus on propositional Horn clause logic.
- Dataset
- JSON
Room-to-Room (R2R) dataset

The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three...
- Dataset
- JSON
WebQuestions

The task of Question Answering over Linked Data (QALD) has received increased attention over the last years (see the surveys [14] and [36]). The task consists in mapping natural...
- Dataset
- JSON
InsuranceQA

The InsuranceQA dataset is a question answering dataset containing questions and answers.
- Dataset
- JSON
LLaMA Pro

LLaMA Pro: Progressive LLaMA with Block Expansion
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

196 datasets found