2 datasets found

Groups: Information Retrieval Organizations: No Organization Formats: JSON

Filter Results
  • Wikimarks

    The Wikimarks dataset, which consists of 30 million deduplicated paragraphs from all Wikipedia articles.
  • TREC-CAR Benchmark Y1

    The dataset used for the Retrieve-Cluster-Summarize system, consisting of 117 article-level queries and 126 test queries.