Jailbreak Attack Dataset

The dataset used in the paper to evaluate the effectiveness of different jailbreak attack methods against Large Language Models (LLMs).

Data and Resources

Cite this as

Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang (2024). Dataset: Jailbreak Attack Dataset. https://doi.org/10.57702/wlykmymo

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Junjie Chu
More Authors
Yugeng Liu
Ziqing Yang
Xinyue Shen
Michael Backes
Yang Zhang