Dataset - LDM

Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks

The dataset used in this paper is a reinforcement learning benchmark problem, specifically the Bipedal Walker, Acrobot, and Continuous Lunar Lander tasks.
- Dataset
- JSON
Constraint Sampling Reinforcement Learning

The dataset used in the paper is a set of environments for reinforcement learning, including movie recommendations, educational activities sequencing, and HIV treatment.
- Dataset
- JSON
BRIDGE dataset

The BRIDGE dataset is a collection of 155 deterministic MDPs, each with a horizon of 100 time steps. The dataset is used to evaluate the performance of reinforcement learning...
- Dataset
- JSON
DeepMind Control Suite and PyBullet Environments

The dataset used in this paper is the DeepMind Control Suite and PyBullet Environments.
- Dataset
- JSON
The Arcade Learning Environment: An Evaluation Platform for General Agents

The Arcade Learning Environment (ALE) is a lasting and indispensable element of the RL researcher’s toolbox. It is also the focus of our work. Since its inception, hundreds of...
- Dataset
- JSON
Visual Grid World Environment and TextWorld domain

The dataset used in the paper is a Visual Grid World Environment and the TextWorld domain.
- Dataset
- JSON
Archive Distillation

The archive A contains policies parameterized by deep neural networks and trained via a state of the art QD-RL method PPGA.
- Dataset
- JSON
Generating Behaviorally Diverse Policies with Latent Diffusion Models

Quality Diversity (QD) is an emerging field in which collections of high performing, behaviorally diverse solutions are trained. The foundational method, Map Elites, maintains...
- Dataset
- JSON
OpenAI Gym and Atari games

The dataset used in the paper is not explicitly described, but it is mentioned that the authors conducted experiments on several representative tasks from the OpenAI Gym and...
- Dataset
- JSON
Relay Policy Learning

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning.
- Dataset
- JSON
Reinforcement Learning for (Mixed) Integer Programming: Smart Feasibility Pump

Mixed integer programming (MIP) problems with a linear objective, linear constraints, and integral constraints.
- Dataset
- JSON
Fine-tuning Language Models with Advantage-Induced Policy Alignment

The dataset used in the paper is the Anthropic Helpfulness and Harmlessness dataset and the StackExchange dataset.
- Dataset
- JSON
Gymnasium

Gymnasium: A library for reinforcement learning
- Dataset
- JSON
PyFlyt

PyFlyt - uav flight simulator gymnasium environments for reinforcement learning research
- Dataset
- JSON
Automated Driving Dataset

The dataset used in the paper for automated driving, including scenarios with occluded intersections and merging.
- Dataset
- JSON
Towards True Lossless Sparse Communication in Multi-Agent Systems

The dataset used in the paper is a multi-agent reinforcement learning environment, where agents need to communicate with each other to achieve their goals.
- Dataset
- JSON
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
- Dataset
- JSON
MuJoCo Environments with Noise Augmentation

The dataset used in the paper is a set of MuJoCo environments with noise augmentation.
- Dataset
- JSON
DMLab-30

DMLab-30 is a benchmark for multitask reinforcement learning in partially observable environments.
- Dataset
- JSON
Car Racing game dataset

The dataset used in this paper is the Car Racing game dataset, which consists of pixel frames of a car racing game.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

397 datasets found