2 datasets found

Tags: attack

Filter Results
  • Forbidden Question Dataset

    The dataset used to evaluate the effectiveness of different jailbreak attack methods against LLMs. The dataset contains 160 forbidden questions with high diversity.
  • Jailbreak Attack Dataset

    The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).