-
Chinese–Japanese Unsupervised Neural Machine Translation Using Sub-character ...
Chinese–Japanese Unsupervised Neural Machine Translation Using Sub-character Level Information -
Aozorabunko dataset
Aozorabunko dataset used for pre-training of PnG BERT model. -
Wikipedia2 and Aozorabunko datasets
Wikipedia2 and Aozorabunko datasets used for pre-training of PnG BERT model. -
KFTT datasets
KFTT English↔Japanese translation datasets. -
NIST 2003 (MT03), NIST 2004 (MT04), NIST 2005 (MT05), NIST 2006 (MT06) datasets
Chinese↔English translation tasks, KFTT English↔Japanese translation datasets. -
JSUT corpus
The dataset is a large vocabulary Japanese accent dictionary built using the proposed technique.