WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models

WanJuan: A comprehensive multimodal dataset for advancing English and Chinese large models.

BibTex: