-
MuJoCo Environment
The dataset used in the paper is a MuJoCo environment, with 13-states and 4-control inputs, nonlinear dynamics with polynomial dependency in the control inputs. -
Unitree A1
The dataset used in the paper for safe reinforcement learning for legged locomotion. -
Unitree Laikago
The dataset used in the paper for safe reinforcement learning for legged locomotion. -
Guard: A safe reinforcement learning benchmark
The dataset used in the paper is a collection of robot locomotion tasks with various constraints. -
HandManipulateBlock
The HandManipulateBlock environment from OpenAI gym robotics suite -
FetchPickAndPlace and HandManipulateBlock
The FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite -
FetchPush, FetchPickAndPlace and HandManipulateBlock
The FetchPush, FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite -
Funnel board
The Funnel board task is a domain where a ball falls through a grid of obstacles onto one of five platforms. Every other row of obstacles consists of funnel-shaped objects,... -
Room runner
The Room runner task is a domain where an agent moves through a randomly generated map of rooms, which are observed in 2D from above. The agent follows the policy of always... -
Event Camera-based Reinforcement Learning
The dataset used in the paper is a simulated environment for event camera-based reinforcement learning. The dataset includes a car-like robot equipped with an event camera, and... -
Multiple-confounded-Mujoco-Envs
The dataset used in the paper is a collection of environments with multiple confounders, including mass, length, damping, and a crippled leg. The dataset is used to evaluate the... -
Simulated Robot Environment
The dataset used in the paper is a simulated robot environment. -
Simulated Self-Driving Car Environment
The dataset used in the paper is a simulated self-driving car environment, a mountain car environment, and a robotic environment. -
Theremin Dataset
The dataset used for training a robotic agent to play the theremin instrument using time-dependent goals. -
Motor Babbling Data
The dataset used in the paper is a set of motor babbling data, which is used to initialize the dynamics model and to optimize the control policies. -
Continual World
The Continual World benchmark consists of ten realistic robotic manipulation tasks.