Reinforcement Learning - Groups

Relay Policy Learning

Relay policy learning: Solving long-horizon tasks via imitation and reinforcement learning.
- Dataset
- JSON
Reinforcement Learning for (Mixed) Integer Programming: Smart Feasibility Pump

Mixed integer programming (MIP) problems with a linear objective, linear constraints, and integral constraints.
- Dataset
- JSON
Fine-tuning Language Models with Advantage-Induced Policy Alignment

The dataset used in the paper is the Anthropic Helpfulness and Harmlessness dataset and the StackExchange dataset.
- Dataset
- JSON
Gymnasium

Gymnasium: A library for reinforcement learning
- Dataset
- JSON
PyFlyt

PyFlyt - uav flight simulator gymnasium environments for reinforcement learning research
- Dataset
- JSON
Automated Driving Dataset

The dataset used in the paper for automated driving, including scenarios with occluded intersections and merging.
- Dataset
- JSON
Towards True Lossless Sparse Communication in Multi-Agent Systems

The dataset used in the paper is a multi-agent reinforcement learning environment, where agents need to communicate with each other to achieve their goals.
- Dataset
- JSON
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
- Dataset
- JSON
MuJoCo Environments with Noise Augmentation

The dataset used in the paper is a set of MuJoCo environments with noise augmentation.
- Dataset
- JSON
DMLab-30

DMLab-30 is a benchmark for multitask reinforcement learning in partially observable environments.
- Dataset
- JSON
Car Racing game dataset

The dataset used in this paper is the Car Racing game dataset, which consists of pixel frames of a car racing game.
- Dataset
- JSON
OpenAI Gym Environment dataset

The dataset used in this paper is the OpenAI Gym Environment dataset, which consists of various games and environments.
- Dataset
- JSON
Atari 2600 games dataset

The dataset used in this paper is the Atari 2600 games dataset, which consists of 50 Atari 2600 games.
- Dataset
- JSON
RLBench

The dataset used in the paper is RLBench, a standard benchmark for vision-based robotics which has been shown to serve as a proxy for real-robot experiments.
- Dataset
- JSON
UNAS: Differentiable Architecture Search Meets Reinforcement Learning

UNAS: Differentiable Architecture Search Meets Reinforcement Learning
- Dataset
- JSON
ProcGen

The dataset used in the paper is a procedurally generated environment called ProcGen.
- Dataset
- JSON
Continual World

The Continual World benchmark consists of ten realistic robotic manipulation tasks.
- Dataset
- JSON
Minigrid

Minigrid environment, a grid-based problem in reinforcement learning.
- Dataset
- JSON
ML4H Findings Track Collection: Machine Learning for Health (ML4H) 2023

A synthetic dataset for training a family of Reinforcement Learning (RL) methods to build explainable pathways for the differential diagnosis of anemia, as a primary use case.
- Dataset
- JSON
REBEL

REBEL is a dataset for reward regularization based robotic reinforcement learning from human feedback.
- Dataset
- JSON

328 datasets found