-
Archive Distillation
The archive A contains policies parameterized by deep neural networks and trained via a state of the art QD-RL method PPGA. -
Generating Behaviorally Diverse Policies with Latent Diffusion Models
Quality Diversity (QD) is an emerging field in which collections of high performing, behaviorally diverse solutions are trained. The foundational method, Map Elites, maintains... -
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Diffusion models are powerful generative models that allow for precise control over the characteristics of the generated samples. While these diffusion models trained on large...