1 dataset found

Tags: Reward Regularization

Filter Results
  • REBEL

    REBEL is a dataset for reward regularization based robotic reinforcement learning from human feedback.
You can also access this registry using the API (see API Docs).