Dataset - LDM

Visualizing MuZero Models

MuZero, a model-based reinforcement learning algorithm that uses a value equivalent dynamics model.
- Dataset
- JSON
Google Research Football

The Google Research Football environment is a reinforcement learning experimental platform focused on training agents to play football.
- Dataset
- JSON
ViZDoom

The dataset is used for training a neural network consisting of 2 blocks of convolutional layers followed by max pooling layers and an LSTM block.
- Dataset
- JSON
Super Mario Bros

The dataset used in the Generative Adversarial Exploration for Reinforcement Learning paper.
- Dataset
- JSON
Chain MDP

The dataset used in the Generative Adversarial Exploration for Reinforcement Learning paper.
- Dataset
- JSON
Atari 2600

The dataset used in the paper is the Atari 2600 dataset, which consists of 49 games. The dataset is used to test the Successor Uncertainties algorithm.
- Dataset
- JSON
CartPole, Pendulum, and LunarLander

The dataset used in the paper is a set of environments for reinforcement learning, including CartPole, Pendulum, and LunarLander.
- Dataset
- JSON
Dreamer

The dataset used for training the Dreamer model.
- Dataset
- JSON
MineCLIP

The MineCLIP dataset is a large-scale dataset of Minecraft demonstrations.
- Dataset
- JSON
dm_control

The dataset used for training the dm_control environment.
- Dataset
- JSON
GenRL

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a combination of reinforcement learning and generative models to solve...
- Dataset
- JSON
PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning

Patch-based attacks introduce a perceptible but localized change to the input that induces misclassiﬁcation. A limitation of cur- rent patch-based black-box attacks is that they...
- Dataset
- JSON
BSuite

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the BSuite environment for the Umbrella-Length task.
- Dataset
- JSON
Direct preference optimization: Your language model is secretly a reward model

The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used a language model to optimize the performance of a reinforcement...
- Dataset
- JSON
Metadrive: Composing diverse driving scenarios for generalizable reinforcemen...

The dataset used in the paper is Metadrive, a driving simulator.
- Dataset
- JSON
LightZero: A unified benchmark for Monte Carlo Tree Search in general sequent...

The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used Atari environments and board games to evaluate the proposed algorithm.
- Dataset
- JSON
Markov Decision Process

The dataset used in the paper is a Markov Decision Process, where states can take values in a state space X, corresponding to a state x ∈ X, we can take an action u ∈ U,...
- Dataset
- JSON
DeepMind Control Suite

The DeepMind Control Suite is a collection of 20 robotic manipulation tasks, each with 5 different environments and 5 different robot parameters. The tasks are designed to test...
- Dataset
- JSON
Wizard

The Wizard dataset is a trick-taking game with 4 players, consisting of 15 rounds. Each round consists of a dealing, bidding, playing, and evaluation phase.
- Dataset
- JSON
Atari57

The dataset used in the paper is Atari57.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

198 datasets found