-
CBBQ: A Chinese Bias Benchmark Dataset Curated with Human-AI Collaboration fo...
CBBQ is a Chinese Bias Benchmark dataset curated with Human-AI Collaboration for Large Language Models. It consists of over 100K questions jointly constructed by human experts... -
Wikipedia Neutrality Corpus
This dataset is used to test the ability of large language models to detect and correct biased Wikipedia edits according to Wikipedia's Neutral Point of View (NPOV) policy.