-
MaMiC: Macro and Micro Curriculum for Robotic Reinforcement Learning
MaMiC: Macro and Micro Curriculum for Robotic Reinforcement Learning -
Time Limits in Reinforcement Learning
The dataset used in the paper is a reinforcement learning dataset, specifically for time-limited tasks and time-unlimited tasks. -
Theremin Dataset
The dataset used for training a robotic agent to play the theremin instrument using time-dependent goals. -
MiniGrid and BabyAI environments
The dataset used in the paper is a reinforcement learning environment, specifically the MiniGrid and BabyAI environments. -
Gridworld, Torus, and Four Rooms Environments
The dataset used in the paper is a set of environments with different topological properties, including a gridworld, a torus, and a four rooms environment. The agent is tasked... -
Grid World Navigation Task
The dataset used in the paper is a grid world navigation task with four actions: up, down, left, or right. Transitions are stochastic and with a 5% probability the agent moves... -
RL4health: Crowdsourcing Reinforcement Learning for Knee Replacement Pathway ...
The dataset is a collection of full episodes of claims records from hundreds of successful past treatments. -
Multi-Agent Games Using Adaptive Feedback Control
The dataset used in this paper is a set of five game-theoretic tasks (Harmony Game, Hawk-Dove, Stag-Hunt, Prisoners Dilemma and Battle of the Exes) with seven different agent... -
Desk Cleanup environment
The Desk Cleanup environment, which consists of a robotic arm and several blocks on a desk. -
Fetch Manipulation environment with 3 blocks
The Fetch Manipulation environment with 3 blocks, which consists of a robotic arm with a gripper and three blocks. -
Fetch Manipulation environment with 2 blocks
The Fetch Manipulation environment with 2 blocks, which consists of a robotic arm with a gripper and two blocks. -
Fetch Manipulation environment
The Fetch Manipulation environment built on top of Mujoco, which consists of a robotic arm with a gripper and square blocks. -
Breaking the Deadly Triad with a Target Network
The dataset used in the paper "Breaking the Deadly Triad with a Target Network" for training and testing the proposed algorithms. -
Reinforcement Learning with Convex Constraints
The dataset used in the paper is a reinforcement learning problem with arbitrary convex constraints. -
Breakout, SeaQuest, Space Invaders, and Qbert Environments
The dataset used in this work is the Breakout, SeaQuest, Space Invaders, and Qbert environments. -
Atari Environment
The dataset used in this work is the Atari environment in OpenAI Gym, created by the Arcade Learning Environment (ALE).