-
OpenAI Gym dataset
The dataset used in the paper is the OpenAI Gym dataset, which consists of a set of environments for reinforcement learning. -
Arcade Learning Environment (ALE) dataset
The dataset used in the paper is the Arcade Learning Environment (ALE) dataset, which consists of 57 classic Atari games. -
Interaction Networks
Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms -
Playground environment
The Playground environment is a continuous 2D world. In each episode, N = 3 objects are uniformly sampled from a set of 32 different object types (e.g. dog, cactus, sofa, water,... -
Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Env...
To overcome the sim-to-real gap in reinforcement learning (RL), learned policies must maintain robustness against environmental uncertainties. While robust RL has been widely... -
Towards Socially and Morally Aware RL agent: Reward Design With LLM
The 2D Grid World environment with various items and consequences -
Cartpole-v1
The Cartpole-v1 environment is used to evaluate the performance of Federated Reinforcement Distillation (FRD) framework. -
Visualizing MuZero Models
MuZero, a model-based reinforcement learning algorithm that uses a value equivalent dynamics model. -
Google Research Football
The Google Research Football environment is a reinforcement learning experimental platform focused on training agents to play football. -
Deterministic Policy Gradients With General State Transitions
The authors used the ComplexPoint-v0, Pendulum-v0, LunarLanderContinuous-v2, Swimmer-v2, HalfCheetah-v2, HumanoidStandup-v2, Humanoid-v2 datasets for experiments. -
Super Mario Bros
The dataset used in the Generative Adversarial Exploration for Reinforcement Learning paper. -
Atari 2600
The dataset used in the paper is the Atari 2600 dataset, which consists of 49 games. The dataset is used to test the Successor Uncertainties algorithm. -
CartPole, Pendulum, and LunarLander
The dataset used in the paper is a set of environments for reinforcement learning, including CartPole, Pendulum, and LunarLander. -
Autohedger dataset
The dataset is used to train and test the autohedger model.