harmless/harmful anchor datasets

You're currently viewing an old version of this dataset. To see the current version, click here.

This dataset contains 100 harmless and 100 harmful anchor prompts for evaluating the performance of large language models.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Zheng et al. (2024). Dataset: harmless/harmful anchor datasets. https://doi.org/10.57702/g83nrnnd

DOI retrieved: December 16, 2024