Dataset Groups Activity Stream Dictation dataset The dictation dataset across 39 locales, including Latin (Albanian, Icelandic, Slovak), Arabic (Levant, Maghrebi), Cyrillic (Macedonian, Kazakh), Devanagari (Nepali), etc. BibTex: @dataset{Junwen_Bai_and_Bo_Li_and_Qiujia_Li_and_Tara_N_Sainath_and_Trevor_Strohman_2024, abstract = {The dictation dataset across 39 locales, including Latin (Albanian, Icelandic, Slovak), Arabic (Levant, Maghrebi), Cyrillic (Macedonian, Kazakh), Devanagari (Nepali), etc.}, author = {Junwen Bai and Bo Li and Qiujia Li and Tara N. Sainath and Trevor Strohman}, doi = {10.57702/vwgn8jce}, institution = {No Organization}, keyword = {'Dictation', 'Multilingual', 'Speech Recognition'}, month = {dec}, publisher = {TIB}, title = {Dictation dataset}, url = {https://service.tib.eu/ldmservice/dataset/dictation-dataset}, year = {2024} }