SciMT-Safety

The SciMT-Safety dataset is a benchmark for evaluating the safety of AI systems in science. It consists of hundreds of refined red-teaming queries that span the fields of chemistry and biology.

BibTex: