Skip to content
Log in
Register
Toggle navigation
Datasets
All
Imported
Services
Organizations
Groups
About
Demo
FedORKG
Search Datasets
Home
Datasets
Order by
Relevance
Name Ascending
Name Descending
Last Modified
Go
2 datasets found
Tags:
Continuous Speech Recognition
Filter Results
TIMIT
The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...
Dataset
JSON
WSJ
The WSJ corpus is a large vocabulary continuous speech recognition dataset. It contains 36416 sequences, representing around 80 hours of speech.
Dataset
JSON
You can also access this registry using the
API
(see
API Docs
).
Before browse our site, please accept our
cookies policy
Accept and close this alert