You're currently viewing an old version of this dataset. To see the current version, click here.

Speech Commands

The Speech Commands dataset consists of 105809 one-second audio recordings of 35 spoken words sampled at 16kHz. The raw speech commands dataset presents audio recordings as a sequence of 16000 samples for speech classification.

Data and Resources

This dataset has no data

Cite this as

Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han (2024). Dataset: Speech Commands. https://doi.org/10.57702/4caa1t6e

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2206.03398
Citation
  • https://doi.org/10.48550/arXiv.2402.14989
  • https://doi.org/10.48550/arXiv.2010.10682
  • https://doi.org/10.48550/arXiv.2104.07916
Author Ji Lin
More Authors
Wei-Ming Chen
Yujun Lin
John Cohn
Chuang Gan
Song Han
Homepage https://arxiv.org/abs/1804.03209