Dataset - LDM

State-wise Constrained Policy Optimization

State-wise Constrained Policy Optimization (SCPO) is a general-purpose policy search algorithm for state-wise constrained reinforcement learning.
- Dataset
- JSON
Pretrained Visual Representations in Reinforcement Learning

Visual reinforcement learning (RL) has made significant progress in recent years, but the choice of visual feature extractor remains a crucial design decision.
- Dataset
- JSON
DRiLLS: Deep Reinforcement Learning for Logic Synthesis

Logic synthesis requires extensive tuning of the synthesis optimization flow where the quality of results (QoR) depends on the sequence of optimizations used. The authors...
- Dataset
- JSON
Interactive Scoring IRL

The dataset used in the paper is a set of trajectories and scores provided by human teachers to train a behavioral policy in a sparse reward environment.
- Dataset
- JSON
MuJoCo Continuous Control Tasks

The dataset used in the paper is a collection of data from the MuJoCo continuous control tasks.
- Dataset
- JSON
NeoRL

A near real-world benchmark for ofﬂine RL, which contains datasets from various domains with controlled sizes, and extra test datasets for policy validation.
- Dataset
- JSON
Defense Against Reward Poisoning Attacks in Reinforcement Learning

We study defense strategies against reward poisoning attacks in reinforcement learning.
- Dataset
- JSON
A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Obser...

The proposed network contains clustering layers, based on earlier work by Afshar et al., 2020 and Bethi et al., 2022, with an introduction of TD-error modulation and eligibility...
- Dataset
- JSON
On the Theory of Reinforcement Learning

The dataset is used to study a theory of reinforcement learning (RL) in which the learner receives binary feedback only once at the end of an episode.
- Dataset
- JSON
HandManipulateBlock

The HandManipulateBlock environment from OpenAI gym robotics suite
- Dataset
- JSON
FetchPickAndPlace and HandManipulateBlock

The FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite
- Dataset
- JSON
FetchPush, FetchPickAndPlace and HandManipulateBlock

The FetchPush, FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite
- Dataset
- JSON
Dense Reward for Free in RLHF

The dataset used in the paper is not explicitly described, but it is mentioned that it is a preference dataset for language models.
- Dataset
- JSON
SAI Dataset

The dataset used for training the SAI agent, containing 7x7 Go games with multiple komi values.
- Dataset
- JSON
MuJoCo environments

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used MuJoCo environments from the OpenAI gym.
- Dataset
- JSON
OpenAI Gym benchmark

The dataset used in the paper is the OpenAI Gym benchmark, which provides a set of environments for reinforcement learning.
- Dataset
- JSON
Funnel board

The Funnel board task is a domain where a ball falls through a grid of obstacles onto one of five platforms. Every other row of obstacles consists of funnel-shaped objects,...
- Dataset
- JSON
Room runner

The Room runner task is a domain where an agent moves through a randomly generated map of rooms, which are observed in 2D from above. The agent follows the policy of always...
- Dataset
- JSON
Discovering Blind Spots in Reinforcement Learning

The dataset used in the paper is a collection of oracle feedback, which is used to learn a blind spot model of the target world.
- Dataset
- JSON
Event Camera-based Reinforcement Learning

The dataset used in the paper is a simulated environment for event camera-based reinforcement learning. The dataset includes a car-like robot equipped with an event camera, and...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

328 datasets found