-
MACHIAVELLI Benchmark
A dataset of traces from the MACHIAVELLI environment, including API calls and their outcomes. -
BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM ...
A structured collection of tests for input-output safeguards, including established failure tests, emerging failure tests, and next-gen architecture tests.