Finnish-Estonian Parallel Data

A bilingual corpus created by triangulating English–Finnish and English–Estonian parallel data, resulting in a set of 679,252 sentence pairs used to extract cognates and improve multilingual translation.

Data and Resources

Cite this as

Stig-Arne Grönroos, Sami Virpioja, Mikko Kurimo (2024). Dataset: Finnish-Estonian Parallel Data. https://doi.org/10.57702/jgbm7r7n

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1808.10791
Author Stig-Arne Grönroos
More Authors
Sami Virpioja
Mikko Kurimo