Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 3 datasets found Groups: Speech Translation Formats: JSON Filter Results Europarl-ST Europarl-ST is a multilingual speech corpus that contains transcriptions of parliamentary debates in multiple languages. Dataset JSON MuST-C: a Multilingual Speech Translation Corpus MuST-C is a multilingual speech translation corpus. Dataset JSON MuST-C MuST-C is a multilingual speech translation dataset, which contains at least 385 hours of audio recordings from TED Talks, with their manual transcriptions and translations at... Dataset JSON