1 dataset found

Tags: Reward Function

Filter Results
  • Interactive Scoring IRL

    The dataset used in the paper is a set of trajectories and scores provided by human teachers to train a behavioral policy in a sparse reward environment.
You can also access this registry using the API (see API Docs).