MCV-10

This work showcases a cost-effective method for generating training data for speech processing tasks. The dataset MCV-10 is a multilingual dataset that contains 50 hours of training data.

Data and Resources

Cite this as

Taras Sereda (2024). Dataset: MCV-10. https://doi.org/10.57702/69wrnvpu

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2406.12674
Author Taras Sereda
Homepage https://huggingface.co/datasets/taras-sereda/uk-pods-conformer