SMIX(λ): Enhancing Centralized Value

SMIX(λ) is a method for cooperative multi-agent reinforcement learning that uses an off-policy training approach to estimate a centralized value function.

BibTex: