-
Euler Equations Dataset
The dataset used in the paper for testing the performance of the trained policy on the Euler equations. -
Backpropagation Through Time and Space
The dataset used in the paper for training a recurrent spatio-temporal network for hyperbolic conservation laws. -
Traffic Light Control Dataset
The traffic light control dataset is used to evaluate the performance of reinforcement learning models in traffic light control. -
HERON: Hierarchical Preference-based Reinforcement Learning
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a hierarchical reward design framework to train policies in various... -
Four Rooms domain
The Four Rooms domain is a classic reinforcement learning environment where an agent must navigate a grid world to reach one of four goals. -
Wield: Systematic Reinforcement Learning With Progressive Randomization
Wield is a system for systematic task design and evaluation in applied RL. It provides a set of reusable software primitives to decouple system interface and RL representation... -
Simulated Self-Driving Car Environment
The dataset used in the paper is a simulated self-driving car environment, a mountain car environment, and a robotic environment. -
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Le...
The dataset used in the paper is a collection of expert demonstrations for various tasks, including robotic manipulation, maze navigation, and Atari games. -
Diagnosing Bottlenecks in Deep Q-Learning Algorithms
The dataset used in the paper is a collection of expert demonstrations for various tasks, including robotic manipulation, maze navigation, and Atari games. -
Posterior Sampling for Reinforcement Learning
The dataset used in the paper is a random finite horizon Markov decision process (MDP) with states S, actions A, and horizon τ. -
Mirror-Reversal and Rotation Tasks
The dataset used in the paper is a set of mirror-reversal and rotation tasks, used to test the performance of different reinforcement learning algorithms. -
Simulated Arm Reaching Task
The dataset used in the paper is a simulated biomechanical model of the human arm, used to test the performance of different reinforcement learning algorithms. -
Quantifying Multimodality in World Models
Multimodality in World Models -
OpenAI Gym’s Mujoco benchmark
The dataset used in this paper is a set of demonstrations for reinforcement learning, containing safe and unsafe trajectories. -
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL S...
The dataset used in this paper is a set of demonstrations for reinforcement learning, containing safe and unsafe trajectories. -
Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning
Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning as a Framework for Emergent Behavior -
Cart-pole problem dataset
The dataset used for the cart-pole problem is a finite set of states: S, a finite set of actions: A, a state transition probability matrix, P, a reward function R, and a... -
DSSE: a Drone Swarm Search Environment
A Drone Swarm Search environment, based on PETTINGZOO, that is to be used in conjunction with multi-agent (or single-agent) reinforcement learning algorithms.