2 datasets found

Tags: Language Model Alignment

Filter Results
  • Anthropic’s Helpfulness and Harmlessness

    The Anthropic’s Helpfulness and Harmlessness datasets are used for preference optimization, which consists of a set of instructions and their corresponding responses.
  • AlpacaFarm

    The AlpacaFarm dataset is a large-scale dataset for preference optimization, which consists of a set of instructions and their corresponding responses.
You can also access this registry using the API (see API Docs).