You're currently viewing an old version of this dataset. To see the current version, click here.

Speech Commands

The Speech Commands dataset consists of 105809 one-second audio recordings of 35 spoken words sampled at 16kHz. The raw speech commands dataset presents audio recordings as a sequence of 16000 samples for speech classification.

Data and Resources

This dataset has no data

Cite this as

Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han (2024). Dataset: Speech Commands. https://doi.org/10.57702/4caa1t6e

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2206.03398
Citation	https://doi.org/10.48550/arXiv.2402.14989 https://doi.org/10.48550/arXiv.2010.10682 https://doi.org/10.48550/arXiv.2104.07916
Author	Ji Lin
More Authors	Wei-Ming Chen Yujun Lin John Cohn Chuang Gan Song Han
Homepage	https://arxiv.org/abs/1804.03209