6 datasets found

Formats: JSON

Filter Results
  • CLEF 2017 e-Health Lab Task 2

    The dataset used for the experiments originated from the CLEF 2017 e-Health Lab Task 2 “Technology Assisted Reviews in Empirical Medicine”.
  • CORD-19

    The CORD-19 dataset contains academic journal articles relating to a variety of coronaviruses and related viral infections, not only COVID-19, sourced from PubMed Central (PMC),...
  • Gorilla

    The Gorilla system, a fine-tuned LLaMA model with additional capabilities to retrieve documents and integrate this information during both training and inference.
  • TREC 2019 Document Ranking (TREC2019 Document)

    Dense retrieval (DR) has shown promising results in information retrieval. In essence, DR requires high-quality text representations to support effective search in the...
  • MS MARCO Document Ranking (MARCO Dev Document)

    Dense retrieval (DR) has shown promising results in information retrieval. In essence, DR requires high-quality text representations to support effective search in the...
  • TREC DL

    TREC 2019 Deep Learning Track has the same training and dev set as MS MARCO, but replaces the test set with a novel set produced by TREC.