Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Tags: Evaluation Filter Results HaluEval-Sum The dataset used in this paper is HaluEval-Sum, a large-scale hallucination evaluation benchmark for large language models. Dataset JSON