2 datasets found

Tags: methods

Filter Results
  • Forbidden Question Dataset

    The dataset used to evaluate the effectiveness of different jailbreak attack methods against LLMs. The dataset contains 160 forbidden questions with high diversity.
  • Jailbreak Attack Dataset

    The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).
You can also access this registry using the API (see API Docs).