-
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL S...
The dataset used in this paper is a set of demonstrations for reinforcement learning, containing safe and unsafe trajectories. -
Wearable Data Collection System for Studying Micro-Level E-Scooter Behavior
A wearable data collection system for studying micro-level e-Scooter behavior in a naturalistic road environment. -
RealToxicityPrompts
RealToxicityPrompts constitutes a collection of 100k naturally occurring sentences, amassed from various internet sources and designed to function as LM prompts. -
PKU-SafeRLHF dataset
The dataset used in the paper is the PKU-SafeRLHF dataset.