Open Subtitles dataset

The Open Subtitles dataset consists of transcriptions of spoken dialog in movies and television shows.

Data and Resources

Cite this as

Pavel Sountsov, Sunita Sarawagi (2024). Dataset: Open Subtitles dataset. https://doi.org/10.57702/g079m54y

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.1606.03402
Author Pavel Sountsov
More Authors
Sunita Sarawagi
Homepage https://opus.ling.ups.edu.pl/OpenSubtitles