Dataset - LDM

HallusionBench

HallusionBench is an advanced diagnostic suite for entangled language hallucination and visual illusion in large vision-language models.
- Dataset
- JSON
VALOR-BENCH

VALOR-BENCH is a comprehensive human-annotated dataset covering hallucinations in large vision-language models, with a focus on measuring hallucinations in generative tasks.
- Dataset
- JSON
TruthfulQA

The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods.
- Dataset
- JSON
Detecting Hallucinated Content in Conditional Neural Sequence Generation

Neural sequence models can generate highly fluent sentences, but recent studies have also shown that they are also prone to hallucinate additional content not supported by the...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found