-
Ro - Fr, Ro - It, Ro - Es, Ro - Pt
The dataset used in this paper is a collection of 400 pairs of cognates and non-cognates for Romanian-French, Romanian-Italian, Romanian-Spanish, and Romanian-Portuguese languages. -
French Wikipedia
French Wikipedia corpus -
PADT, EWT, GSD, HDT, and SynTagRus
PADT, EWT, GSD, HDT, and SynTagRus are UD treebanks. -
Maurdor dataset
Manifold Mixup improves text recognition with CTC loss