DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction

A large-scale human-annotated corpus for disfluency correction in four Indo-European languages: English, Hindi, German, and French.

Data and Resources

Cite this as

Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya (2025). Dataset: DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction. https://doi.org/10.57702/x0kdc6zw

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2310.16749
Author Vineet Bhat
More Authors
Preethi Jyothi
Pushpak Bhattacharyya
Homepage https://github.com/vineet2104/DISCO