1 dataset found

Tags: HH-RLHF

Filter Results
  • HH-RLHF

    The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback.
You can also access this registry using the API (see API Docs).