1 dataset found

Groups: Truthfulness Tags: language model evaluation

Filter Results
  • TruthfulQA

    The TruthfulQA dataset is a dataset that contains 817 questions designed to evaluate language models' preference to mimic some human falsehoods.
You can also access this registry using the API (see API Docs).