-
Phishstorm phishing/legitimate url dataset
Phishstorm phishing/legitimate url dataset -
Malicious urls dataset
Malicious urls dataset -
Sms phishing dataset
Sms phishing dataset for machine learning and pattern recognition -
Birds-to-Words
The Birds-to-Words dataset contains 15,931 images (12,770 training and 3,151 testing) tagged with descriptions of fine-grained differences between pairwise bird images. -
OmniObject3D
OmniObject3D is a real-scanned 3D object dataset with 6000 samples. For efficiency, we randomly select 100 objects for evaluation. -
Conceptual Captions
The dataset used in the paper "Scaling Laws of Synthetic Images for Model Training". The dataset is used for supervised image classification and zero-shot classification tasks. -
COCO 2017 Detection Dataset
A large dataset for object detection, containing 118k training images and 5k validation images. -
VQA: Visual Question Answering
Visual Question Answering (VQA) has emerged as a prominent multi-discipline research problem in both academia and industry. -
REDD dataset
The REDD dataset is a dataset for energy disaggregation. It contains about half month power consumption from real homes in US, for the whole house as well as for each individual... -
The UK-DALE dataset
The UK-DALE dataset contains measurements of aggregate and appliance power consumption in five UK homes. -
Benchmark Fair Classification Dataset
The dataset used in the paper for fair subgroup mixup for improving group fairness. -
Law School Admission Bar Passage
The dataset used in the paper for fair subgroup mixup for improving group fairness. -
Sintel Dataset
The dataset used in the paper is a Sintel dataset, which consists of low-resolution optical flow maps and their corresponding high-resolution RGB images. -
Caltech101
The dataset used in the paper is Caltech101, which is a natural image classification dataset. It contains 101 categories of natural images.