-
DCG-MAP-Elites-AI
The dataset used in this paper is a set of seven continuous control locomotion tasks implemented in Brax, derived from standard RL benchmarks. -
Pong and Breakout
The dataset used in the paper is a collection of tasks from OpenAI Gym, specifically Pong and Breakout. -
HalfCheetah-v3
The dataset used in the paper is a modified version of the HalfCheetah-v3 environment from the OpenAI gym. -
FrozenLake-v0
The dataset used in the paper is a modified version of the FrozenLake-v0 environment from the OpenAI gym. -
PPO Cartpole Task
PPO Cartpole Task -
Euler Equations Dataset
The dataset used in the paper for testing the performance of the trained policy on the Euler equations. -
Backpropagation Through Time and Space
The dataset used in the paper for training a recurrent spatio-temporal network for hyperbolic conservation laws. -
Traffic Light Control Dataset
The traffic light control dataset is used to evaluate the performance of reinforcement learning models in traffic light control. -
HERON: Hierarchical Preference-based Reinforcement Learning
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a hierarchical reward design framework to train policies in various... -
Four Rooms domain
The Four Rooms domain is a classic reinforcement learning environment where an agent must navigate a grid world to reach one of four goals. -
Quantifying Multimodality in World Models
Multimodality in World Models -
Reinforcement Learning for UAV Autonomous Navigation, Mapping and Target Dete...
The dataset is used for target detection and mapping problem, performed by a UAV (i.e., the agent), which autonomously navigates an indoor environment. -
DSSE: a Drone Swarm Search Environment
A Drone Swarm Search environment, based on PETTINGZOO, that is to be used in conjunction with multi-agent (or single-agent) reinforcement learning algorithms. -
Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Lear...
Value-function (VF) approximation is a central problem in Reinforcement Learning (RL). Classical non-parametric VF estimation suffers from the curse of dimensionality. As a... -
Multi-Stage Gridworld
The dataset used in the paper is the Multi-Stage Gridworld environment, which is a 3D environment that requires the agent to navigate through a gridworld to find a goal. -
Treasure Mountain
The dataset used in the paper is the Treasure Mountain environment, which is a 2D environment that requires the agent to navigate through a mountain to find a treasure. -
Inverted Pendulum
The dataset used in the paper is an Inverted Pendulum dataset, which is a standard benchmark system in control and reinforcement learning. -
ProcGen Maze and Leaper
The ProcGen Maze and Leaper environments are procedurally generated game environments used to benchmark reinforcement learning. -
MineRL-v0 dataset
The MineRL-v0 dataset contains human demonstration data for tasks in Minecraft. -
MineRL BASALT competition
The MineRL BASALT competition dataset contains human demonstration data for four tasks in Minecraft.