UKWaC and Wackypedia corpora

The dataset used in this paper is a large text corpus compiled from UKWaC and Wackypedia corpora.

BibTex: