HaluEval-Sum

The dataset used in this paper is HaluEval-Sum, a large-scale hallucination evaluation benchmark for large language models.

Data and Resources

Cite this as

Jushi Kai Hai Hu Zhouhan Lin (2024). Dataset: HaluEval-Sum. https://doi.org/10.57702/os6l406d

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2401.05930
Author Jushi Kai Hai Hu Zhouhan Lin
Homepage https://github.com/tatsu-lab/stanford_alpaca