The dataset used in the paper is a multi-agent reinforcement learning dataset, where agents learn to coordinate their actions to achieve a common goal.
Pommerman is a 2-vs-2 game environment introduced in 2018, and is a modified version of the popular game 'Bomberman'. It is a good testbed for RL, because the environment...