Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

New Natural Language Process (NLP) benchmarks are urgently needed to align with the rapid development of large language models (LLMs). We present Xiezhi, the most comprehensive evaluation suite designed to assess holistic domain knowledge.

Data and Resources

Cite this as

Zhouhong Gu, Xiaoxuan Zhu, Haoning Ye, Lin Zhang, Jianchen Wang, Yixin Zhu, Sihang Jiang, Zhuozhi Xiong, Zihan Li, Weijie Wu, Qianyu He, Rui Xu, Wenhao Huang, Jingping Liu, Zili Wang, Shusen Wang, Weiguo Zheng, Hongwei Feng, Yanghua Xiao (2024). Dataset: Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation. https://doi.org/10.57702/a0jcr8w3

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2306.05783
Link to ORKG http://orkg.org/orkg/resource/R642975
Author Zhouhong Gu
More Authors
Xiaoxuan Zhu
Haoning Ye
Lin Zhang
Jianchen Wang
Yixin Zhu
Sihang Jiang
Zhuozhi Xiong
Zihan Li
Weijie Wu
Qianyu He
Rui Xu
Wenhao Huang
Jingping Liu
Zili Wang
Shusen Wang
Weiguo Zheng
Hongwei Feng
Yanghua Xiao
Homepage https://github.com/MikeGu721/XiezhiBenchmark