-
State-wise Constrained Policy Optimization
State-wise Constrained Policy Optimization (SCPO) is a general-purpose policy search algorithm for state-wise constrained reinforcement learning. -
Linear Functions with Dynamic Uniform Constraints
The dataset used in the paper is the Linear Functions with Dynamic Uniform Constraints problem, a constrained optimization problem where the goal is to maximize a linear...