-
SemEval-2020 Task 4: Commonsense Validation and Explanation (ComVE)
The dataset for SemEval-2020 Task 4: Commonsense Validation and Explanation (ComVE) consists of 10 sentences: two similar sentences and three options each. -
Blood Glucose Prediction Dataset
A dataset of 24 past blood glucose values, used for predicting future blood glucose values. -
Smart card dataset of bus ridership
Three-month smart card dataset of bus ridership, containing over 10 million observations allied with detailed weather measurements, trip length, calendar events, and built... -
Wildfire Risk Prediction
Wildfire risk prediction dataset -
CAp 2018 dataset
The dataset used in the CAp 2018 competition for language level prediction. -
AllenCell dataset
The AllenCell dataset contains 12 subcellular structures (i.e., ActinFilament, ActomBundle, CellMembrane, Desmosome, DNA, EndopReticulum, GolgiApparatus, Microtubule,... -
Bankruptcy Prediction Dataset
The dataset used for bankruptcy prediction, containing explanatory variables from 33 research papers and the Freddie Mac single-family loan-level dataset. -
SPREADSHEETCODER: Formula Prediction from Semi-structured Context
Spreadsheet formula prediction has been an im-portant program synthesis problem with many real-world applications. Previous works typically utilize input-output examples as the... -
Predicting Football Match Outcomes with eXplainable Machine Learning and the ...
The Premier League match data covering the 2019-2021 seasons. -
Cellular traffic analysis
A real traffic dataset generated from a real user using deep machine learning as well as statistical learning. -
CASP12 dataset
The CASP12 dataset is a benchmark for protein structure prediction. -
Temporal Data Mining
The dataset used in this study consists of three clinical datasets: oncology, hepatitis, and diabetes. -
HumanAct12
HumanAct12 dataset is a large-scale 3D human motion dataset with textual descriptions.