1 dataset found

Formats: JSON Tags: Alpaca Eval 2

Filter Results
  • Alpaca Eval 2

    The dataset used in the paper is Alpaca Eval 2, which is an automated metric that measures LLMs' alignment with human preferences.
You can also access this registry using the API (see API Docs).