Reinforcement Learning - Groups

Replay Buffer

The dataset used in the paper is a replay buffer containing observations from a navigation task.

Dataset
JSON

MuJoCo Environment

The dataset used in the paper is a MuJoCo environment, with 13-states and 4-control inputs, nonlinear dynamics with polynomial dependency in the control inputs.

Dataset
JSON

Corridor Environment

The corridor environment is a simple environment where the agent has to determine whether the rewarding cell (colored yellow) is at the top or bottom, based on the color of the...

Dataset
JSON

Guard: A safe reinforcement learning benchmark

The dataset used in the paper is a collection of robot locomotion tasks with various constraints.

Dataset
JSON

State-wise Constrained Policy Optimization

State-wise Constrained Policy Optimization (SCPO) is a general-purpose policy search algorithm for state-wise constrained reinforcement learning.

Dataset
JSON

NeoRL

A near real-world benchmark for ofﬂine RL, which contains datasets from various domains with controlled sizes, and extra test datasets for policy validation.

Dataset
JSON

Defense Against Reward Poisoning Attacks in Reinforcement Learning

We study defense strategies against reward poisoning attacks in reinforcement learning.

Dataset
JSON

A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Obser...

The proposed network contains clustering layers, based on earlier work by Afshar et al., 2020 and Bethi et al., 2022, with an introduction of TD-error modulation and eligibility...

Dataset
JSON

On the Theory of Reinforcement Learning

The dataset is used to study a theory of reinforcement learning (RL) in which the learner receives binary feedback only once at the end of an episode.

Dataset
JSON

HandManipulateBlock

The HandManipulateBlock environment from OpenAI gym robotics suite

Dataset
JSON

FetchPickAndPlace and HandManipulateBlock

The FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite

Dataset
JSON

FetchPush, FetchPickAndPlace and HandManipulateBlock

The FetchPush, FetchPickAndPlace and HandManipulateBlock environments from OpenAI gym robotics suite

Dataset
JSON

Dense Reward for Free in RLHF

The dataset used in the paper is not explicitly described, but it is mentioned that it is a preference dataset for language models.

Dataset
JSON

Funnel board

The Funnel board task is a domain where a ball falls through a grid of obstacles onto one of five platforms. Every other row of obstacles consists of funnel-shaped objects,...

Dataset
JSON

Room runner

The Room runner task is a domain where an agent moves through a randomly generated map of rooms, which are observed in 2D from above. The agent follows the policy of always...

Dataset
JSON

Event Camera-based Reinforcement Learning

The dataset used in the paper is a simulated environment for event camera-based reinforcement learning. The dataset includes a car-like robot equipped with an event camera, and...

Dataset
JSON

UNIMALS

The dataset used in the paper is a collection of robot morphologies, each with a unique set of actuators and joints.

Dataset
JSON

Alchemy: A structured task distribution for meta-reinforcement learning

The Alchemy benchmark is a meta-learning environment rich enough to contain interesting abstractions, yet simple enough to make ne-grained analysis tractable.

Dataset
JSON

Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making

Symbolic regression holds great potential for uncovering underlying mathematical and physical relationships from empirical data. The authors introduce Symbolic Q-network...

Dataset
JSON

Multiple-confounded-Mujoco-Envs

The dataset used in the paper is a collection of environments with multiple confounders, including mass, length, damping, and a crippled leg. The dataset is used to evaluate the...

Dataset
JSON

100 datasets found