ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation

ALI-Agent is an evaluation framework that leverages the autonomous abilities of LLM-powered agents to probe adaptive and long-tail risks in target LLMs.

BibTex: