Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

doi:doi:10.57702/a0jcr8w3

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation

New Natural Language Process (NLP) benchmarks are urgently needed to align with the rapid development of large language models (LLMs). We present Xiezhi, the most comprehensive evaluation suite designed to assess holistic domain knowledge.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Zhouhong Gu, Xiaoxuan Zhu, Haoning Ye, Lin Zhang, Jianchen Wang, Yixin Zhu, Sihang Jiang, Zhuozhi Xiong, Zihan Li, Weijie Wu, Qianyu He, Rui Xu, Wenhao Huang, Jingping Liu, Zili Wang, Shusen Wang, Weiguo Zheng, Hongwei Feng, Yanghua Xiao (2024). Dataset: Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation. https://doi.org/10.57702/a0jcr8w3

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2306.05783
Link to ORKG	http://orkg.org/orkg/resource/R642975
Author	Zhouhong Gu
More Authors	Xiaoxuan Zhu Haoning Ye Lin Zhang Jianchen Wang Yixin Zhu Sihang Jiang Zhuozhi Xiong Zihan Li Weijie Wu Qianyu He Rui Xu Wenhao Huang Jingping Liu Zili Wang Shusen Wang Weiguo Zheng Hongwei Feng Yanghua Xiao
Homepage	https://github.com/MikeGu721/XiezhiBenchmark