-
TransMuCoRes
Translated dataset for Multilingual Coreference Resolution (TransMuCoRes) in 31 South Asian languages. -
DBP15KZH-EN, DBP15KJA-EN, and DBP15KFR-EN datasets
The DBP15KZH-EN, DBP15KJA-EN, and DBP15KFR-EN datasets are used for cross-lingual entity alignment. The datasets contain entities, relations, and attributes, and are used to... -
Multilingual Text Classification Dataset
Multilingual text classification dataset with 17 different languages -
Robust Multilingual Named Entity Recognition with Shallow Semi-Supervised Fea...
Multilingual Named Entity Recognition approach based on a robust and general set of features across languages and datasets. -
UD Dataset v2.7
The UD Dataset v2.7 is a multilingual dataset for dependency parsing. -
Parallel Meaning Bank
A semantically annotated parallel corpus for English, German, Italian, and Dutch where sentences are aligned with scoped meaning representations in order to capture the... -
WikiMatrix
The WikiMatrix dataset is a multilingual dataset that contains parallel texts between English and other languages. -
Very Deep Multilingual Convolutional Neural Networks for LVCSR
Convolutional neural networks (CNNs) are a standard component of many current state-of-the-art Large Vocabulary Continuous Speech Recognition (LVCSR) systems. However, CNNs in...