Activity Stream
-
admin updated the dataset Anthropic Helpfulness Base eval
2 days ago | View this version | Changes -
admin created the dataset Anthropic Helpfulness Base eval
2 days ago | View this version -
admin updated the dataset Anthropic Helpfulness Base
2 days ago | View this version | Changes -
admin created the dataset Anthropic Helpfulness Base
2 days ago | View this version -
admin updated the dataset Multimodal Query Suggestion with Multi-Agent Reinforcement Learning from Human Feedback
2 days ago | View this version | Changes -
admin created the dataset Multimodal Query Suggestion with Multi-Agent Reinforcement Learning from Human Feedback
2 days ago | View this version -
admin updated the dataset HIVE: Harnessing Human Feedback for Instructional Visual Editing
3 days ago | View this version | Changes -
admin created the dataset HIVE: Harnessing Human Feedback for Instructional Visual Editing
3 days ago | View this version -
admin updated the dataset Differences in Fairness Preferences
3 days ago | View this version | Changes -
admin created the dataset Differences in Fairness Preferences
3 days ago | View this version -
admin updated the dataset Anthropic HH dataset
3 days ago | View this version | Changes -
admin created the dataset Anthropic HH dataset
3 days ago | View this version -
admin updated the dataset Training a helpful and harmless assistant with reinforcement learning from human feedback
2 weeks ago | View this version | Changes -
admin created the dataset Training a helpful and harmless assistant with reinforcement learning from human feedback
2 weeks ago | View this version -
admin updated the dataset SHP dataset
2 weeks ago | View this version | Changes -
admin created the dataset SHP dataset
2 weeks ago | View this version -
admin updated the dataset HH-RLHF dataset
2 weeks ago | View this version | Changes -
admin created the dataset HH-RLHF dataset
2 weeks ago | View this version -
admin updated the dataset Toxic-DPO Dataset
2 weeks ago | View this version | Changes -
admin created the dataset Toxic-DPO Dataset
2 weeks ago | View this version -
admin updated the dataset Anthropic-HH-RLHF Dataset
2 weeks ago | View this version | Changes -
admin created the dataset Anthropic-HH-RLHF Dataset
2 weeks ago | View this version -
admin updated the dataset UltraRM-13B
2 weeks ago | View this version | Changes -
admin created the dataset UltraRM-13B
2 weeks ago | View this version -
admin updated the dataset AlpacaFarm
2 weeks ago | View this version | Changes -
admin created the dataset AlpacaFarm
2 weeks ago | View this version -
admin updated the dataset Anthropic-HH
2 weeks ago | View this version | Changes -
admin created the dataset Anthropic-HH
2 weeks ago | View this version -
admin created the group Human Feedback
2 weeks ago