ALCUNA: Large Language Models Meet New Knowledge

ALCUNA is a benchmark for evaluating the ability of large language models (LLMs) to handle new knowledge.

Data and Resources

Cite this as

Xunjian Yin, Baizhou Huang, Xiaojun Wan (2024). Dataset: ALCUNA: Large Language Models Meet New Knowledge. https://doi.org/10.57702/l8pqb4xg

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2310.14820
Author Xunjian Yin
More Authors
Baizhou Huang
Xiaojun Wan