CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models

doi:doi:10.57702/ebvshhdy

CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models

CBBQ is a Chinese Bias Benchmark dataset curated with Human-AI Collaboration for Large Language Models. It consists of over 100K questions jointly constructed by human experts and generative language models, covering 14 social dimensions related to Chinese culture and values.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Yufei Huang, Deyi Xiong (2025). Dataset: CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models. https://doi.org/10.57702/ebvshhdy

DOI retrieved: January 3, 2025

Additional Info

Field	Value
Created	January 3, 2025
Last update	January 3, 2025
Defined In	https://doi.org/10.48550/arXiv.2306.16244
Author	Yufei Huang
More Authors	Deyi Xiong
Homepage	https://github.com/YFHuangxxxx/CBBQ