1 dataset found

Groups: Language Models Organizations: No Organization Formats: JSON

Filter Results
  • HH-RLHF

    The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback.