-
Mujoco control tasks
The authors used the Mujoco control tasks, including Ant-v2, HalfCheetah-v2, Hopper-v2, and Walker2d-v2. -
Cart-pole problem dataset
The dataset used for the cart-pole problem is a finite set of states: S, a finite set of actions: A, a state transition probability matrix, P, a reward function R, and a... -
OpenAI Gym and Atari games
The dataset used in the paper is not explicitly described, but it is mentioned that the authors conducted experiments on several representative tasks from the OpenAI Gym and... -
Continual World
The Continual World benchmark consists of ten realistic robotic manipulation tasks. -
Open Bandit Dataset and Pipeline
Open bandit dataset and pipeline: Towards realistic and reproducible off-policy evaluation -
Uncertainty-Aware Model-Based Reinforcement Learning with Application to Auto...
The proposed uncertainty-aware model-based reinforcement learning framework is applied to end-to-end autonomous driving tasks. -
Bootstrapped DQN
The Bootstrapped DQN dataset is a collection of 49 Atari games. -
Incentivizing Exploration in Atari
The Incentivizing Exploration in Atari dataset is a collection of 49 Atari games. -
Arcade Learning Environment
The Arcade Learning Environment (ALE) dataset is a collection of 49 Atari games. -
Rainbow dataset
The dataset used in the paper is the Rainbow dataset, which is a combination of six extensions to the DQN algorithm. -
Dynamic Frame Skip Deep Q-Network (DFDQN) dataset
The dataset used in the paper is the Dynamic Frame Skip Deep Q-Network (DFDQN) dataset, which consists of 3 Atari games: Seaquest, Space Invaders, and Alien. -
Deep Q-Network (DQN) dataset
The dataset used in the paper is the Deep Q-Network (DQN) dataset, which consists of 15 classic Atari games. -
Proximal Policy Optimization (PPO) dataset
The dataset used in the paper is the Proximal Policy Optimization (PPO) dataset, which consists of 10 different Atari environments. -
OpenAI Gym dataset
The dataset used in the paper is the OpenAI Gym dataset, which consists of a set of environments for reinforcement learning. -
Arcade Learning Environment (ALE) dataset
The dataset used in the paper is the Arcade Learning Environment (ALE) dataset, which consists of 57 classic Atari games. -
Playground environment
The Playground environment is a continuous 2D world. In each episode, N = 3 objects are uniformly sampled from a set of 32 different object types (e.g. dog, cactus, sofa, water,...