FLoRes Benchmark

doi:doi:10.57702/l5u8otrx

FLoRes Benchmark

The FLoRes dataset is a benchmark designed for low-resource machine translation. It includes English-to-Nepali translations with approximately 564,000 parallel sentences, making it considerably more challenging due to the language distance.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc’Aurelio Ranzato (2024). Dataset: FLoRes Benchmark. https://doi.org/10.57702/l5u8otrx

DOI retrieved: November 25, 2024

Additional Info

Field	Value
Created	November 25, 2024
Last update	November 25, 2024
Defined In	https://doi.org/10.48550/arXiv.2004.08053
Author	Francisco Guzmán
More Authors	Peng-Jen Chen Myle Ott Juan Pino Guillaume Lample Philipp Koehn Vishrav Chaudhary Marc’Aurelio Ranzato