-
Hindi-English Code-Switched Sentences
The dataset used in the paper is a collection of Hindi-English code-switched sentences. -
NEWS 2010 English-Hindi test set
The NEWS 2010 English-Hindi test set is used for transliteration equivalence evaluation. -
NEWS 2009 English-Hindi training set
The NEWS 2009 English-Hindi training set is used for transliteration equivalence learning. -
Mujadia et al. dataset
Manually annotated dataset for coreference resolution in Hindi. -
VisualGenome datasets
The VisualGenome datasets containing Bengali, Hindi, and Malayalam sentences for fine-tuning. -
Patrika Dataset
Patrika dataset is used as independent test set. -
Hindinews and Livehindustan Articles
Hindinews, Livehindustan and Patrika newspaper articles available open source in Kaggle encompassing similar domains. -
Bengali and Hindi News Articles
Bengali dataset consists of articles from online public news portals such as Prothom-Alo, BDNews24 and Nayadiganta. The articles encompass domains such as politics,...