Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Language Models Formats: JSON Filter Results HH-RLHF The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback. Dataset JSON