Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Language Models Filter Results HH-RLHF The HH-RLHF dataset is a human preference dataset for reinforcement learning from human feedback. Dataset JSON