-
Hungry Geese
The Hungry Geese environment is a discrete environment where four geese are placed on a 7x11 grid, and the goal is to collect food while avoiding collisions with other geese. -
Vehicle Routing Problem
The Vehicle Routing Problem (VRP) is a classic problem in combinatorial optimization. The problem is to find the shortest route that visits each node in a graph exactly once and... -
Newsvendor
The Newsvendor problem is a classic problem in inventory management. The problem is to determine the optimal order quantity to satisfy uncertain demand. -
Bin Packing
The Bin Packing problem is a classic problem in Operations Research and Computer Science. The problem is to pack items of different sizes into bins of fixed capacity. -
ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Pro...
Reinforcement Learning (RL) has achieved state-of-the-art results in domains such as robotics and games. We build on this previous work by applying RL algorithms to a selection... -
Comparison Datasets for Imitation Learning and Reinforcement Learning
Comparison datasets for Imitation Learning and Reinforcement Learning. -
RILe: Reinforced Imitation Learning
Reinforced Imitation Learning (RILe) dataset, which consists of expert demonstrations and noisy expert data. -
StarCraft Multi-Agent Challenge (SMAC)
The dataset used in the paper is the StarCraft Multi-Agent Challenge (SMAC) environment. -
Multiple Domain Cyberspace Attack and Defense Game Model
The dataset used in the paper is a multiple domain cyberspace attack and defense game model based on reinforcement learning. -
An End-to-End Deep RL Framework for Task Arrangement in Crowdsourcing Platforms
A Deep Reinforcement Learning framework for task arrangement in crowdsourcing platforms. -
BabyAI-PickUpDist-v0
The dataset used in the paper is the BabyAI environment 'BabyAI-PickUpDist-v0' with a one-pickup-per-episode wrapper. -
SHARING LIFELONG REINFORCEMENT LEARNING KNOWLEDGE VIA MODULATING MASKS
The CT-graph and Minigrid environments are used to evaluate lifelong reinforcement learning approaches. -
Atari 2600 domain
The Atari 2600 domain dataset, used for training and testing reinforcement learning algorithms. -
Real-World RL Challenge
The dataset used in the paper is the Real-World RL Challenge dataset. It contains a set of continuous control tasks. -
DeepMind Control Suite and Real-World RL Experiments
The dataset used in the paper is the DeepMind Control Suite and Real-World RL Experiments. It contains a set of continuous control tasks based on MuJoCo.