-
Multilingual dataset
A high-quality dataset in English and 12 other languages, augmented with rhyme schema at the paragraph level. -
BABEL-Pashto
The BABEL-Pashto dataset is a multilingual speech recognition dataset containing Pashto speech recordings. -
Pali: A Jointly-Scaled Multilingual Language-Image Model
This paper proposes a method called Pali, which jointly scales visual and vision-language representation learning. -
TransMuCoRes
Translated dataset for Multilingual Coreference Resolution (TransMuCoRes) in 31 South Asian languages. -
Massively multilingual neural machine translation
The dataset used for multilingual neural machine translation. -
Very Deep Multilingual Convolutional Neural Networks for LVCSR
Convolutional neural networks (CNNs) are a standard component of many current state-of-the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However, CNNs in... -
CommonVoice
The sequence-to-sequence approach is widely used in speech recognition (SR) nowadays, and many research works are dedicated to show that their capabilities relying on a single...