You're currently viewing an old version of this dataset. To see the current version, click here.

Libri-Light

The dataset used in the paper is the Libri-Light dataset, which is a subset of the LibriSpeech dataset. The authors used this dataset to pre-train their proposed dual-mode ASR model.

Data and Resources

Cite this as

Sang-Gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon (2024). Dataset: Libri-Light. https://doi.org/10.57702/r1l8l07l

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2205.15370
Citation
  • https://doi.org/10.48550/arXiv.2010.11481
  • https://doi.org/10.48550/arXiv.2207.11906
Author Sang-Gil Lee
More Authors
Heeseung Kim
Chaehun Shin
Xu Tan
Chang Liu
Qi Meng
Tao Qin
Wei Chen
Sungroh Yoon
Homepage https://github.com/Google-Lab-MTL/PyTorch-CTC