The dataset used in the paper is the Simple Spread environment, which consists of N agents and N landmarks. The aim of teamed agents is to learn covering all the landmarks while avoiding collisions.
BibTex:
Before browse our site, please accept our cookies policy