-
NYT Dataset
The NYT dataset is a collection of articles published between 2012 and 2022. -
Patrika Dataset
Patrika dataset is used as independent test set. -
Nayadiganta Dataset
Nayadiganta dataset is used as independent test set. -
Hindinews and Livehindustan Articles
Hindinews, Livehindustan and Patrika newspaper articles available open source in Kaggle encompassing similar domains. -
Bengali and Hindi News Articles
Bengali dataset consists of articles from online public news portals such as Prothom-Alo, BDNews24 and Nayadiganta. The articles encompass domains such as politics,... -
News Articles Dataset
The dataset used in this paper is a collection of news articles from an international news website, covering a time span from September 2012 to April 2014. -
20NewsGroups
The dataset used in this paper is a collection of documents from various domains, including news, articles, and emails.