Information Retrieval - Groups

Wikipedia Corpus

The dataset used in the paper is a subset of the Wikipedia corpus, consisting of 7500 English Wikipedia articles belonging to one of the following categories: People, Cities,...

Dataset
JSON

Wikipedia dataset

The dataset used in the paper is the Wikipedia dataset, which contains over six million English Wikipedia articles with a full-text field associated with 50 training queries...

Dataset
JSON

20NewsGroups

The dataset used in this paper is a collection of documents from various domains, including news, articles, and emails.

Dataset
JSON

3 datasets found

Wikipedia Corpus

Wikipedia dataset

20NewsGroups