You're currently viewing an old version of this dataset. To see the current version, click here.

GigaSpeech

GigaSpeech: An evolving, multi-domain ASR corpus with 10,000 hours of transcribed audio.

Data and Resources

This dataset has no data

Cite this as

G. Chen, S. Chai, G.-B. Wang, J. Du, W.-Q. Zhang, C. Weng, D. Su, D. Povey, J. Trmal, J. Zhang, M. Jin, S. Khudanpur, S. Watanabe, S. Zhao, W. Zou, X. Li, X. Yao, Y. Wang, Z. You, Z. Yan (2024). Dataset: GigaSpeech. https://doi.org/10.57702/h9qwaxti

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2406.17272
Citation
  • https://doi.org/10.48550/arXiv.2309.14758
  • https://doi.org/10.48550/arXiv.2312.09100
Author G. Chen
More Authors
S. Chai
G.-B. Wang
J. Du
W.-Q. Zhang
C. Weng
D. Su
D. Povey
J. Trmal
J. Zhang
M. Jin
S. Khudanpur
S. Watanabe
S. Zhao
W. Zou
X. Li
X. Yao
Y. Wang
Z. You
Z. Yan
Homepage https://gigaspeech.org/