-
CartPole dataset
The dataset used in the paper is a high-dimensional state space of the CartPole environment, artificially expanded to 500 dimensions using a union of five different nonlinear... -
Time Limits in Reinforcement Learning
The dataset used in the paper is a reinforcement learning dataset, specifically for time-limited tasks and time-unlimited tasks. -
Multi-Agent Games Using Adaptive Feedback Control
The dataset used in this paper is a set of five game-theoretic tasks (Harmony Game, Hawk-Dove, Stag-Hunt, Prisoners Dilemma and Battle of the Exes) with seven different agent... -
Desk Cleanup environment
The Desk Cleanup environment, which consists of a robotic arm and several blocks on a desk. -
Fetch Manipulation environment with 3 blocks
The Fetch Manipulation environment with 3 blocks, which consists of a robotic arm with a gripper and three blocks. -
Fetch Manipulation environment with 2 blocks
The Fetch Manipulation environment with 2 blocks, which consists of a robotic arm with a gripper and two blocks. -
Fetch Manipulation environment
The Fetch Manipulation environment built on top of Mujoco, which consists of a robotic arm with a gripper and square blocks. -
Breakout, SeaQuest, Space Invaders, and Qbert Environments
The dataset used in this work is the Breakout, SeaQuest, Space Invaders, and Qbert environments. -
Automatic Curricula via Expert Demonstrations (ACED)
ACED constructs a curriculum by sampling states from expert demonstration trajectories as initializations for each training episode, where the samples initially come from near... -
OpenAI Gym Reacher Environment
The dataset used in the paper is a set of data collected from the OpenAI Gym Reacher environment. -
ATOMICS-2 dataset
The dataset used for testing the reinforcement learning agent for medical alarm annotation. -
ATOMICS dataset
The dataset used for training and testing the reinforcement learning agent for medical alarm annotation. -
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Rese...
OmniSafe is a comprehensive infrastructural framework designed to accelerate Safe Reinforcement Learning research.