Soft Actor-Critic

A soft actor-critic algorithm for off-policy maximum entropy deep reinforcement learning.

BibTex: