-
DCG-MAP-Elites-AI
The dataset used in this paper is a set of seven continuous control locomotion tasks implemented in Brax, derived from standard RL benchmarks. -
Archive Distillation
The archive A contains policies parameterized by deep neural networks and trained via a state of the art QD-RL method PPGA. -
Generating Behaviorally Diverse Policies with Latent Diffusion Models
Quality Diversity (QD) is an emerging field in which collections of high performing, behaviorally diverse solutions are trained. The foundational method, Map Elites, maintains...