-
Four Rooms
The Four Rooms environment is a stochastic version of the classic Atari game Four Rooms. The environment has 104 states and 4 actions, and the agent can move in any of the 4... -
Soft Actor-Critic With Integer Actions
Reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization. -
Kuka Object Manipulation Datasets
The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm. -
Navigation and Manipulation Datasets
The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm. -
Robomimic Environment
Robomimic environment consists of tasks such as lift, can, square, tool-hang, and transport. -
D4RL Benchmark Suite
D4RL benchmark suite consists of tasks such as locomotion, antmaze, adroit, and kitchen. -
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected... -
Random Walk dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the random walk exploration method. -
RND dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the RND exploration method. -
SMM dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the SMM exploration method. -
ChronoGEM dataset
The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the ChronoGEM exploration method. -
Gridworld Dataset
The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy.