Dataset - LDM

IWSLT2018 Speech Translation Task

The dataset used in the paper is the IWSLT2018 speech translation task, which consists of five parts: TED corpus, Speech-translation TED corpus, TED LIUM corpus, WMT18 data and...
- Dataset
- JSON
TED2012 ASR and MT dataset

The dataset used in the paper is a collection of English ASR hypotheses from the eight submissions on the tst2012 test set in the IWSLT 2013 TED talk ASR track, along with...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found