OPUS EMEA Corpus
The dataset was created by collecting an updated version of the European Medicines Agency (EMEA) corpus and applying new methods for text extraction from pdf files, sentence splitting, sentence alignment and parallel corpus filtering.
BibTex: