3 datasets found

Tags: jailbreak

Filter Results
  • Jailbreak Prompts and Malicious Queries

    The dataset comprises 448 in-the-wild jailbreak prompts and 161 malicious queries, with which the authors derived a systemization of five categories and ten unique jailbreak...
  • Forbidden Question Dataset

    The dataset used to evaluate the effectiveness of different jailbreak attack methods against LLMs. The dataset contains 160 forbidden questions with high diversity.
  • Jailbreak Attack Dataset

    The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).
You can also access this registry using the API (see API Docs).