Soft Actor-Critic With Integer Actions

Reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization.

BibTex: