Zh-En Multi-Domain Dataset

The Zh-En multi-domain dataset consists of four balanced domains: news, patent, subtitles, and COVID-19.

Data and Resources

Cite this as

Chunting Zhou, Graham Neubig, Jiatao Gu, Mona Diab, Paco Guzman, Luke Zettlemoyer (2024). Dataset: Zh-En Multi-Domain Dataset. https://doi.org/10.57702/yohbsb3w

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2011.02593
Author Chunting Zhou
More Authors
Graham Neubig
Jiatao Gu
Mona Diab
Paco Guzman
Luke Zettlemoyer
Homepage https://github.com/violet-zct/fairseq-detect-hallucination