Libri-Light

doi:doi:10.57702/r1l8l07l

Libri-Light

The dataset used in the paper is the Libri-Light dataset, which is a subset of the LibriSpeech dataset. The authors used this dataset to pre-train their proposed dual-mode ASR model.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Sang-Gil Lee, Heeseung Kim, Chaehun Shin, Xu Tan, Chang Liu, Qi Meng, Tao Qin, Wei Chen, Sungroh Yoon (2024). Dataset: Libri-Light. https://doi.org/10.57702/r1l8l07l

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.2205.15370
Citation	https://doi.org/10.48550/arXiv.2010.11481 https://doi.org/10.48550/arXiv.2207.11906
Author	Sang-Gil Lee
More Authors	Heeseung Kim Chaehun Shin Xu Tan Chang Liu Qi Meng Tao Qin Wei Chen Sungroh Yoon
Homepage	https://github.com/Google-Lab-MTL/PyTorch-CTC