Dataset - LDM

AdvBench dataset

The dataset used for the experiments in the paper, consisting of 60 harmful instructions from the AdvBench dataset.
- Dataset
- JSON
AdvBench

The dataset used in the paper to test the Gradient Cuff method for detecting jailbreak attacks on large language models.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

Before browse our site, please accept our cookies policy