You're currently viewing an old version of this dataset. To see the current version, click here.

Generated Training Dataset for Biomedical NLU

The dataset consists of user utterances for querying Electronic Health Records (EHRs) in the biomedical domain, generated using templates and augmented with paraphrases. A total of 178 manually annotated questions served as a gold standard.

Data and Resources

Cite this as

Antoine Neuraz, Anita Burgun, Leonardo Campillos Llanos, Sophie Rosset (2024). Dataset: Generated Training Dataset for Biomedical NLU. https://doi.org/10.57702/35cpgdvx

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1811.09417
Author Antoine Neuraz
More Authors
Anita Burgun
Leonardo Campillos Llanos
Sophie Rosset