Dataset - LDM

Deep Attention Recurrent Q-Network

The Deep Attention Recurrent Q-Network (DARQN) algorithm was tested on several popular Atari 2600 games: Breakout, Seaquest, Space Invaders, Tutankham, and Gopher.
- Dataset
- JSON
BSuite

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the BSuite environment for the Umbrella-Length task.
- Dataset
- JSON
Direct preference optimization: Your language model is secretly a reward model

The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used a language model to optimize the performance of a reinforcement...
- Dataset
- JSON
Metadrive: Composing diverse driving scenarios for generalizable reinforcemen...

The dataset used in the paper is Metadrive, a driving simulator.
- Dataset
- JSON
LightZero: A unified benchmark for Monte Carlo Tree Search in general sequent...

The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used Atari environments and board games to evaluate the proposed algorithm.
- Dataset
- JSON
Distributional Reinforcement Learning with Quantile Regression

Distributional reinforcement learning with quantile regression
- Dataset
- JSON
Markov Decision Process

The dataset used in the paper is a Markov Decision Process, where states can take values in a state space X, corresponding to a state x ∈ X, we can take an action u ∈ U,...
- Dataset
- JSON
Meta-World and Robomimic

The dataset used in the paper is a robotic manipulation task dataset, which consists of trajectories and preference labels.
- Dataset
- JSON
DeepMind Control Suite

The DeepMind Control Suite is a collection of 20 robotic manipulation tasks, each with 5 different environments and 5 different robot parameters. The tasks are designed to test...
- Dataset
- JSON
BBRL Activations Dataset

The dataset used in the paper is a collection of activations from a feature extraction network and a reactive network, used to train a Variational Autoencoder (VAE) to learn...
- Dataset
- JSON
Deep Reinforcement Learning Based Controller for Active Heave Compensation

Heave compensation is an essential part in various oﬀshore operations. It is used in various applications, which include on-loading or oﬀ-loading systems, oﬀshore drilling,...
- Dataset
- JSON
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models

Learning from human feedback has been shown to improve text-to-image models. These techniques first learn a reward function that captures what humans care about in the task and...
- Dataset
- JSON
Atari57

The dataset used in the paper is Atari57.
- Dataset
- JSON
Google Research Football (GRF)

The dataset used in the paper is Google Research Football (GRF) environment.
- Dataset
- JSON
Cart-Pole, Pendulum, and Cart-Pole Balance Environments

The dataset used in this paper is a set of three classic control environments: cart-pole swing-up (CPSU), pendulum swing-up (PSU), and cart-pole balance (CPB).
- Dataset
- JSON
D4RL Benchmark

D4RL benchmark dataset, which consists of four offline logging datasets, collected by different one or mixed behavior policies.
- Dataset
- JSON
Roboschool

The dataset used in the ACE algorithm for continuous control problems.
- Dataset
- JSON
D4RL

D4RL datasets for maze2d-umaze, maze2d-medium, maze2d-large, antmaze-umaze, antmaze-medium, antmaze-large, parking, soccer-sim, and soccer-physical tasks
- Dataset
- JSON
Blocksworld Dataset

The Blocksworld dataset is a photo-realistic environment, where the goal is to move blocks around to achieve a goal state. The dataset consists of 480/2592 possible...
- Dataset
- JSON
8-Puzzle Dataset

The dataset used in the paper is an 8-puzzle environment, where the goal is to solve the puzzle by moving tiles around. The dataset consists of 20000 transition inputs, which...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

328 datasets found