You're currently viewing an old version of this dataset. To see the current version, click here.

Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd

The dataset consists of synthetic, phonetically proximate options which emulate post-editing scenarios of varying difficulty for five languages: Arabic, German, Hindi, Russian, and Spanish.

Data and Resources

This dataset has no data

Cite this as

Purushotam Radadia, Shirish Karande (2024). Dataset: Feasibility of Post-Editing Speech Transcriptions with a Mismatched Crowd. https://doi.org/10.57702/xu2jwru9

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.1609.02043
Author	Purushotam Radadia
More Authors	Shirish Karande