Jailbreak Attacks - Groups

SafeDecoding dataset

The dataset used in the SafeDecoding paper, which contains 32 harmful queries spanning 16 harmful categories.

Dataset
JSON

Comprehensive Assessment of Jailbreak Attacks against LLMs

The Comprehensive Assessment of Jailbreak Attacks against LLMs dataset is used to evaluate the effectiveness of jailbreak attacks on language models.

Dataset
JSON

Forbidden Question Dataset

The dataset used to evaluate the effectiveness of different jailbreak attack methods against LLMs. The dataset contains 160 forbidden questions with high diversity.

Dataset
JSON

Jailbreak Attack Dataset

The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).

Dataset
JSON

4 datasets found

SafeDecoding dataset

Comprehensive Assessment of Jailbreak Attacks against LLMs

Forbidden Question Dataset

Jailbreak Attack Dataset