Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Groups: Information Retrieval Organizations: No Organization Formats: JSON Filter Results Wikimarks The Wikimarks dataset, which consists of 30 million deduplicated paragraphs from all Wikipedia articles. Dataset JSON TREC-CAR Benchmark Y1 The dataset used for the Retrieve-Cluster-Summarize system, consisting of 117 article-level queries and 126 test queries. Dataset JSON