State-wise Constrained Policy Optimization

State-wise Constrained Policy Optimization (SCPO) is a general-purpose policy search algorithm for state-wise constrained reinforcement learning.

BibTex: