FLoRes Benchmark

The FLoRes dataset is a benchmark designed for low-resource machine translation. It includes English-to-Nepali translations with approximately 564,000 parallel sentences, making it considerably more challenging due to the language distance.

Data and Resources

Cite this as

Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc’Aurelio Ranzato (2024). Dataset: FLoRes Benchmark. https://doi.org/10.57702/l5u8otrx

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.2004.08053
Author Francisco Guzmán
More Authors
Peng-Jen Chen
Myle Ott
Juan Pino
Guillaume Lample
Philipp Koehn
Vishrav Chaudhary
Marc’Aurelio Ranzato