Dataset - LDM

What did you mention? A large scale mention detection benchmark for spoken an...

A large-scale mention detection benchmark for spoken and written text.
- Dataset
- JSON
NeuralQA

A usable library for question answering on large datasets
- Dataset
- JSON
FEVER: A Large-Scale Dataset for Fact Extraction and Verification

The FEVER dataset consists of 185,455 annotated claims, together with 5,416,537 Wikipedia documents containing roughly 25 million sentences as potential evidence.
- Dataset
- JSON
BEIR

The BEIR dataset is a large-scale zero-shot evaluation dataset for information retrieval models, consisting of 13,000 documents and 1,000 questions.
- Dataset
- JSON
TREC Deep Learning track

The TREC Deep Learning track dataset is a collection of question answering datasets, which are used for passage retrieval and ranking.
- Dataset
- JSON
SHACL Satisﬁability and Containment

The Shapes Constraint Language (SHACL) is a recent W3C recommendation language for validating RDF data. This paper provides a translation of SHACL into a new first-order...
- Dataset
- JSON
Good Judgment Open

The Good Judgment Open (GJO) dataset contains 1770 datapoints (698 'forecasts' and 1072 'comments') posted by 242 anonymised users with a range of expertise.
- Dataset
- JSON
QQP Dataset

The QQP dataset contains more than 400k question pairs.
- Dataset
- JSON
Self-Recognition in Language Models

A self-recognition test for language models using model-generated security questions.
- Dataset
- JSON
StockQA

A large-scale dataset containing over 180K StockQA instances, built based on Chinese online stock forums.
- Dataset
- JSON
Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autono...

A driving scenario QA task and a dataset of 160k QA pairs derived from 10k driving scenarios, paired with high quality control commands collected with RL agent and question...
- Dataset
- JSON
MT-bench

The dataset used in the paper is MT-bench, which is an LLM-based automated evaluation metric comprising 80 challenging questions.
- Dataset
- JSON
YAGO3-10

Knowledge graphs are composed of different elements: entity nodes, relation edges, and literal nodes. Each literal node contains an entity’s attribute value (e.g. the height of...
- Dataset
- JSON
QQP

The Quora Question Pairs (QQP) dataset consists of 50,000 question pairs labeled with paraphrase or non-paraphrase.
- Dataset
- JSON
Vicuna dataset

Diffusion-based language models are emerg-ing as a promising alternative to autoregressive LMs: they approach the competence of autoregressive LMs while offering nuanced...
- Dataset
- JSON
DOLLY dataset

Diffusion-based language models are emerg-ing as a promising alternative to autoregressive LMs: they approach the competence of autoregressive LMs while offering nuanced...
- Dataset
- JSON
NYT and WebNLG

NYT and WebNLG are widely used datasets for relational triple extraction.
- Dataset
- JSON
Repair Program

The repair program Π(D, IC) for a database instance D without nulls has the following rules: Program facts: P(¯a) for each atom P(¯a) ∈ D. For a constraint of the form...
- Dataset
- JSON
Repair Programs for Consistent Query Answering

Repair programs for consistent query answering have been well studied in the literature. They specify the database repairs as their stable models. On their basis, and using...
- Dataset
- JSON
AutoCast++: Enhancing World Event Prediction with Zero-Shot Ranking-Based Con...

The Autocast++ dataset is a benchmark for event forecasting using news articles.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

416 datasets found