Dataset - LDM

Simulated Medical Diagnosis Task

The dataset used in the paper is a simulated medical diagnosis task, where patients can lie about their symptoms, and the goal is to predict pregnancy based on self-reported...
- Dataset
- JSON
TruthfulQA

The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods.
- Dataset
- JSON
TruthX: Alleviating Hallucinations by Editing Large Language Models

TruthX: Alleviating Hallucinations by Editing Large Language Models
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

3 datasets found