-
SafeDecoding dataset
The dataset used in the SafeDecoding paper, which contains 32 harmful queries spanning 16 harmful categories. -
Comprehensive Assessment of Jailbreak Attacks against LLMs
The Comprehensive Assessment of Jailbreak Attacks against LLMs dataset is used to evaluate the effectiveness of jailbreak attacks on language models.