-
ICLP 2019 Dataset
The dataset contains 122 submissions of abstracts, 92 full submissions, and 30 accepted papers. -
Netflix Dataset
The dataset used in the paper is a Netflix dataset, which is a large-scale matrix factorization problem. -
AMP sequence dataset
The dataset contains 6438 known AMP sequences and 9522 non-AMP sequences from the DBAASP database. -
CRESCI2017
The CRESCI2017 dataset is a collection of Twitter data, including user information, event logs, and tweets. It contains four types of accounts: genuine users, social bots,... -
Taobao Dataset
Precise user modeling is critical for online personalized recommen-der services. Generally, users’ interests are diverse and are not limited to a single aspect, which is... -
Spacecraft Pose Estimation Dataset (SPEED)
The SPEED dataset is used for training and evaluating spacecraft pose estimation algorithms. -
TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection...
A comprehensive real-world dataset for trademark infringement detection in e-commerce, sourced directly from Alipay, one of the world's largest e-commerce and digital payment... -
CCCS-CIC-AndMal-2020
The CCCS-CIC-AndMal-2020 dataset comprises 400K android apps, to test and assess the suggested methodology. -
comma2k19 dataset
The comma2k19 dataset is used to evaluate the robustness of lane detection models under physical-world adversarial attacks in autonomous driving. -
News Recommendation Dataset
The News Recommendation Dataset is a real-world dataset used to evaluate the performance of the EENR framework. -
Chinese Event Extraction Dataset
The Chinese Event Extraction Dataset is used to train EE modular, and News Recommendation Dataset is our target recommendation dataset. -
Colosseum dataset
The Colosseum dataset is a dataset of network traffic flow level, which is used to train and test the traffic steering algorithm. -
Phi-2: A Dataset for Language Model Evaluation
The Phi-2 dataset is a collection of language models used to evaluate the performance of language models. -
MBPP: A Dataset for Language Model Evaluation
The MBPP dataset is a collection of basic programming questions used to evaluate the performance of language models. -
Traffic Light Control Dataset
The traffic light control dataset is used to evaluate the performance of reinforcement learning models in traffic light control. -
APPS: A Dataset for Code Generation Evaluation
The APPS dataset is a collection of programming problems used to evaluate the performance of code generation models.