Activity Stream
-
admin updated the dataset Training a helpful and harmless assistant with reinforcement learning from human feedback
2 weeks ago | View this version | Changes -
admin created the dataset Training a helpful and harmless assistant with reinforcement learning from human feedback
2 weeks ago | View this version