LibriLight: A Benchmark for ASR with Limited or No Supervision

The LibriLight dataset is a large-scale speech corpus used for self-supervised speech recognition tasks.

Data and Resources

Cite this as

Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli (2025). Dataset: LibriLight: A Benchmark for ASR with Limited or No Supervision. https://doi.org/10.57702/1y5slbfc

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.48550/arXiv.2310.02382
Author Alexei Baevski
More Authors
Henry Zhou
Abdelrahman Mohamed
Michael Auli
Homepage https://catalog.libris.org/dataset/librilight-a-benchmark-for-asr-with-limited-or-no-supervision