MCV-10

doi:doi:10.57702/69wrnvpu

MCV-10

This work showcases a cost-effective method for generating training data for speech processing tasks. The dataset MCV-10 is a multilingual dataset that contains 50 hours of training data.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Taras Sereda (2024). Dataset: MCV-10. https://doi.org/10.57702/69wrnvpu

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2406.12674
Author	Taras Sereda
Homepage	https://huggingface.co/datasets/taras-sereda/uk-pods-conformer