You're currently viewing an old version of this dataset. To see the current version, click here.

CMB-Exam

A large-scale Chinese benchmark for evaluating medical large language models. The dataset consists of 280,839 samples, with 74 tasks, and covers 24 departments and 150 diseases.

Data and Resources

Cite this as

Junling Liu, Peilin Zhou, Yining Hua, Dading Chong, Zhongyu Tian, Andrew Liu, Helin Wang, Chenyu You, Zhenhua Guo, Lei Zhu (2024). Dataset: CMB-Exam. https://doi.org/10.57702/06b9aln5

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2406.13890
Author Junling Liu
More Authors
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
Andrew Liu
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu