Document Retrieval - Groups

CLEF 2017 e-Health Lab Task 2

The dataset used for the experiments originated from the CLEF 2017 e-Health Lab Task 2 “Technology Assisted Reviews in Empirical Medicine”.

Dataset
JSON

CORD-19

The CORD-19 dataset contains academic journal articles relating to a variety of coronaviruses and related viral infections, not only COVID-19, sourced from PubMed Central (PMC),...

Dataset
JSON

Gorilla

The Gorilla system, a fine-tuned LLaMA model with additional capabilities to retrieve documents and integrate this information during both training and inference.

Dataset
JSON

TREC 2019 Document Ranking (TREC2019 Document)

Dense retrieval (DR) has shown promising results in information retrieval. In essence, DR requires high-quality text representations to support eﬀective search in the...

Dataset
JSON

MS MARCO Document Ranking (MARCO Dev Document)