You're currently viewing an old version of this dataset. To see the current version, click here.
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Ramesh, Doddapaneni, Bheemaraj, Jobanputra, AK, Sharma, Sahoo, Diddee, J, Kakwani, Kumar, Pradeep, Deepak, Raghavan, Kunchukuttan, Kumar, Khapra (2024). Dataset: Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages. https://doi.org/10.57702/eajzxmsy
DOI retrieved: December 16, 2024
Additional Info
Field | Value |
---|---|
Created | December 16, 2024 |
Last update | December 16, 2024 |
Defined In | https://doi.org/10.48550/arXiv.2111.11815 |
Author | Ramesh |
More Authors |
|
Homepage | arXiv:2104.05596 |