Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Tags: Question Answering Filter Results Disc-medllm Disc-medllm: Bridging general large language models and real-world medical consultation. Dataset JSON CMB-Exam A large-scale Chinese benchmark for evaluating medical large language models. The dataset consists of 280,839 samples, with 74 tasks, and covers 24 departments and 150 diseases. Dataset JSON