-
Representation Learning with Autoencoders for Electronic Health Records: A Co...
Electronic Health Records (EHRs) dataset used for predictive modeling and feature representation learning -
Diabetic Cohort
The dataset used in this paper is a diabetic cohort from the same hospital, containing 2,840 diabetic admissions. -
Heart Failure Cohort
The dataset used in this paper is a heart failure cohort from an Australian hospital, containing 1,885 heart failure admissions. -
MIMIC-CXR-JPG
MIMIC-CXR-JPG dataset comprises 227,835 imaging studies conducted on 64,588 patients who sought treatment at the BIDMC Emergency Department from 2011 to 2016. -
NFPC dataset
The NFPC dataset is a population-based health survey dataset. This dataset contains detailed personal characteristics of both spouses, which are mainly divided into the... -
OptumLabs Clinical Discovery Database
The dataset used in this study is composed of 1.4 million individuals, with 1.2 billion laboratory test results, covering 47 laboratory-tested human physical components. -
TREC-COVID
The TREC-COVID dataset is a collection of journal articles related to COVID-19 and other coronaviruses, with human annotators providing relevancy judgments at the end of each... -
ICU Length of Stay Dataset
The dataset used in this paper is a healthcare dataset containing information about ICU length of stay for patients who have undergone cardiac surgery. -
Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology
The dataset is used for training and evaluating the Onco-Retriever model, a generative classifier for retrieval of EHR records in oncology. -
Breast Cancer Wisconsin (Original) dataset
The dataset used in the paper is the Breast Cancer Wisconsin (Original) dataset, which contains 699 entries, 9 dimensions, and 2 classes. -
Pima Indian diabetes dataset
The dataset used in the paper is the Pima Indian diabetes dataset, which contains 768 entries, 8 dimensions, and 2 classes. -
Pima Indians Diabetes Database
The Pima Indians Diabetes Database contains samples from females in the Pima Indian population near Phoenix, Arizona. -
PhysioNet 2012
The dataset used in this paper for healthcare data democratization and information leakage prevention. -
MIMIC-III-full-label
Prediction of medical codes from clinical notes is both a practical and essential need for every healthcare delivery organization within current medical systems. -
SUPPORT Dataset
The SUPPORT dataset is a large collection of patient data used for survival analysis. It contains information on demographics, physiological measurements, and outcomes for... -
auton-survival
Applications of machine learning in healthcare often require working with time-to-event prediction tasks including prognostication of an adverse event, re-hospitalization, and... -
UK Biobank dataset
The UK Biobank dataset consists of SAX and LAX cine CMR images of normal subjects. Cardiac structures, LV cavity (LVC), LV myocardium (LVM), and right-ventricle cavity (RVC)... -
Dutch Academic Hospital Dataset
The Dutch Academic Hospital Dataset is a publicly dataset made available by the Business Process Intelligence (BPI) challenge in 2011 by a hospital in the Netherlands.