Factuality Evaluation - Groups - LDM

Factcheck-GPT

Factcheck-GPT is an end-to-end fine-grained document-level fact-checking and correction of LLM output.
- Dataset
- JSON
FEVER 2.0 dataset

The FEVER 2.0 dataset is a collection of claims and evidence sentences for factuality evaluation.
- Dataset
- JSON
FACTOR

The dataset used in this paper is FACTOR, a benchmark for factuality evaluation of language models.
- Dataset
- JSON

Before browse our site, please accept our cookies policy