-
A Corpus of Turkish Offensive Language on Social Media
The dataset is a collection of Turkish tweets containing offensive language. -
Turkish Tweets Dataset
A collection of Turkish tweets about three different Turkish telecommunication brands gathered over one month. -
WIT corpus, SETimes corpus, newsdev2016, newstest2016, and newstest2017
The dataset used in the paper is the WIT corpus, SETimes corpus, newsdev2016, newstest2016, and newstest2017. -
Turkish-English and Uyghur-Chinese machine translation tasks
The dataset used in the paper is the Turkish-English and Uyghur-Chinese machine translation tasks.