ALCUNA: Large Language Models Meet New Knowledge

ALCUNA is a benchmark for evaluating the ability of large language models (LLMs) to handle new knowledge.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Xunjian Yin, Baizhou Huang, Xiaojun Wan (2024). Dataset: ALCUNA: Large Language Models Meet New Knowledge. https://doi.org/10.57702/l8pqb4xg

DOI retrieved: December 16, 2024