Generated Training Dataset for Biomedical NLU

doi:doi:10.57702/35cpgdvx

You're currently viewing an old version of this dataset. To see the current version, click here.

Generated Training Dataset for Biomedical NLU

The dataset consists of user utterances for querying Electronic Health Records (EHRs) in the biomedical domain, generated using templates and augmented with paraphrases. A total of 178 manually annotated questions served as a gold standard.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Antoine Neuraz, Anita Burgun, Leonardo Campillos Llanos, Sophie Rosset (2024). Dataset: Generated Training Dataset for Biomedical NLU. https://doi.org/10.57702/35cpgdvx

DOI retrieved: November 25, 2024

Additional Info

Field	Value
Created	November 25, 2024
Last update	November 25, 2024
Defined In	https://doi.org/10.48550/arXiv.1811.09417
Author	Antoine Neuraz
More Authors	Anita Burgun Leonardo Campillos Llanos Sophie Rosset