-
VoxCeleb dataset
The VoxCeleb dataset is a large-scale speaker identification dataset, used to evaluate the performance of face recognition systems. -
TIMIT dataset
The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated... -
KFTT datasets
KFTT English↔Japanese translation datasets. -
NIST 2003 (MT03), NIST 2004 (MT04), NIST 2005 (MT05), NIST 2006 (MT06) datasets
Chinese↔English translation tasks, KFTT English↔Japanese translation datasets. -
WIT corpus, SETimes corpus, newsdev2016, newstest2016, and newstest2017
The dataset used in the paper is the WIT corpus, SETimes corpus, newsdev2016, newstest2016, and newstest2017. -
Turkish-English and Uyghur-Chinese machine translation tasks
The dataset used in the paper is the Turkish-English and Uyghur-Chinese machine translation tasks. -
IWSLT 2014
The IWSLT 2014 German-to-English dataset is a machine translation dataset, containing 153K sentence pairs. -
English Test Set
The English test set is used for evaluating the performance of the proposed system. -
LibriSpeech dataset
The dataset used in the paper is the LibriSpeech dataset, which contains about 1,000 hours of English speech derived from audiobooks.