-
Robomimic Environment
Robomimic environment consists of tasks such as lift, can, square, tool-hang, and transport. -
D4RL Benchmark Suite
D4RL benchmark suite consists of tasks such as locomotion, antmaze, adroit, and kitchen. -
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected... -
Random Walk dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the random walk exploration method. -
RND dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the RND exploration method. -
SMM dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the SMM exploration method. -
ChronoGEM dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the ChronoGEM exploration method. -
Gridworld Dataset
The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy. -
OpenAI Gym dataset
The dataset used in the paper is the OpenAI Gym dataset, which consists of a set of environments for reinforcement learning. -
Arcade Learning Environment (ALE) dataset
The dataset used in the paper is the Arcade Learning Environment (ALE) dataset, which consists of 57 classic Atari games. -
Interaction Networks
Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms -
Playground environment
The Playground environment is a continuous 2D world. In each episode, N = 3 objects are uniformly sampled from a set of 32 different object types (e.g. dog, cactus, sofa, water,... -
Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Env...
To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely... -
Towards Socially and Morally Aware RL agent: Reward Design With LLM
The 2D Grid World environment with various items and consequences -
Cartpole-v1
The Cartpole-v1 environment is used to evaluate the performance of Federated Reinforcement Distillation (FRD) framework. -
Visualizing MuZero Models
MuZero, a model-based reinforcement learning algorithm that uses a value equivalent dynamics model. -
Google Research Football
The Google Research Football environment is a reinforcement learning experimental platform focused on training agents to play football.