-
SciMT-Safety
The SciMT-Safety dataset is a benchmark for evaluating the safety of AI systems in science. It consists of hundreds of refined red-teaming queries that span the fields of... -
SAIBench: A Structural Interpretation of AI for Science Through Benchmarks
The dataset used for benchmarking machine learning force fields (MLFF) and jet tagging.