Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
The dataset used in this paper is a comprehensive case study dataset, including Safe Navigation-Circle, Safe Navigation-Goal, and Safe Velocity-Half Cheetah environments.
BibTex: