-
Corpus of Spontaneous Japanese
The Corpus of Spontaneous Japanese: Its design and evaluation [30] is a dataset of spontaneous Japanese speech. -
Text-Level Error Type Classification Criteria
The proposed text-level error type classification criteria, which considers 13 text-level errors that can occur in speech recognition situations. -
Speech-Level Error Type Classification Criteria
The proposed speech-level error type classification criteria, which considers 24 sub-types for noise error and 13 sub-types for speaker characteristics. -
Error Explainable Benchmark (EEB) dataset
The proposed Error Explainable Benchmark (EEB) dataset, which considers both speech- and text-level error types, to diagnose and validate ASR models and post-processors. -
DeepSpeech
The DeepSpeech dataset used for evaluation of the proposed watermarking scheme. -
Speech Pattern Based Black-Box Model Watermarking for Automatic Speech Recogn...
The proposed black-box model watermarking framework for protecting the IP of ASR models. -
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction o...
Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units -
wav2vec: Unsupervised Pre-Training for Speech Recognition
Unsupervised Pre-Training for Speech Recognition -
Speech Intelligibility Prediction with DNN-based Performance Measures
The dataset used for speech intelligibility prediction with DNN-based performance measures -
Transformer based Whisper Bangla ASR model
A transformer-based Whisper Bangla ASR model -
BD-4SK-ASR
The dataset used in this paper is BD-4SK-ASR, an experimental dataset which is used in the first attempt in developing an ASR system for Sorani Kurdish. -
IWSLT2018 Speech Translation Task
The dataset used in the paper is the IWSLT2018 speech translation task, which consists of five parts: TED corpus, Speech-translation TED corpus, TED LIUM corpus, WMT18 data and... -
Wall Street Journal
The Wall Street Journal dataset is used for syntactic linearization. It contains a large corpus of news articles with their corresponding syntactic trees.