ALCUNA: Large Language Models Meet New Knowledge

ALCUNA is a benchmark for evaluating the ability of large language models (LLMs) to handle new knowledge.

BibTex: