ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
ALI-Agent is an evaluation framework that leverages the autonomous abilities of LLM-powered agents to probe adaptive and long-tail risks in target LLMs.
BibTex:
Before browse our site, please accept our cookies policy