FACTOR

The dataset used in this paper is FACTOR, a benchmark for factuality evaluation of language models.

BibTex: