CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models

CBBQ is a Chinese Bias Benchmark dataset curated with Human-AI Collaboration for Large Language Models. It consists of over 100K questions jointly constructed by human experts and generative language models, covering 14 social dimensions related to Chinese culture and values.

Data and Resources

Cite this as

Yufei Huang, Deyi Xiong (2025). Dataset: CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration for Large Language Models. https://doi.org/10.57702/ebvshhdy

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2306.16244
Author Yufei Huang
More Authors
Deyi Xiong
Homepage https://github.com/YFHuangxxxx/CBBQ