DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety

The dataset used in this paper is a set of demonstrations for reinforcement learning, containing safe and unsafe trajectories.

BibTex: