You're currently viewing an old version of this dataset. To see the current version, click here.

Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

The dataset consists of synthetic, phonetically proximate options which emulate post-editing scenarios of varying difficulty for five languages: Arabic, German, Hindi, Russian, and Spanish.

Data and Resources

This dataset has no data

Cite this as

Purushotam Radadia, Shirish Karande (2024). Dataset: Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd. https://doi.org/10.57702/xu2jwru9

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.1609.02043
Author Purushotam Radadia
More Authors
Shirish Karande