Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 3 datasets found Groups: Factuality Evaluation Organizations: No Organization Filter Results Factcheck-GPT Factcheck-GPT is an end-to-end fine-grained document-level fact-checking and correction of LLM output. Dataset JSON FEVER 2.0 dataset The FEVER 2.0 dataset is a collection of claims and evidence sentences for factuality evaluation. Dataset JSON FACTOR The dataset used in this paper is FACTOR, a benchmark for factuality evaluation of language models. Dataset JSON