You're currently viewing an old version of this dataset. To see the current version, click here.

MuST-C v1.0

MuST-C v1.0 is a multilingual corpus for end-to-end speech translation, containing 8 language pairs.

Data and Resources

Cite this as

Sara Papi, Marco Turchi, Matteo Negri (2025). Dataset: MuST-C v1.0. https://doi.org/10.57702/cv4smnhx

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.21437/Interspeech.2023-170
Author Sara Papi
More Authors
Marco Turchi
Matteo Negri
Homepage https://github.com/hlt-mt/fbk-fairseq