Dataset - LDM

LunarLander environment

The dataset used in the paper is the LunarLander environment, which is a classic control problem. The agent learns to land a lunar lander using human feedback.
- Dataset
- JSON
CartPole, Pendulum, and LunarLander

The dataset used in the paper is a set of environments for reinforcement learning, including CartPole, Pendulum, and LunarLander.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found