227 datasets found

Tags: Dataset

Filter Results
  • Phishstorm phishing/legitimate url dataset

    Phishstorm phishing/legitimate url dataset
  • Malicious urls dataset

    Malicious urls dataset
  • Sms phishing dataset

    Sms phishing dataset for machine learning and pattern recognition
  • Birds-to-Words

    The Birds-to-Words dataset contains 15,931 images (12,770 training and 3,151 testing) tagged with descriptions of fine-grained differences between pairwise bird images.
  • CIRR

    CIRR is a general image dataset that comprises 36,554 triplets derived from 21,552 images from the popular natural language inference dataset NLVR2.
  • FashionIQ

    The FashionIQ dataset contains images of fashion products over 3 categories: Dress, Toptee, and Shirt, with 46,609 images in the training and 31,075 images in the validation set.
  • OmniObject3D

    OmniObject3D is a real-scanned 3D object dataset with 6000 samples. For efficiency, we randomly select 100 objects for evaluation.
  • Conceptual Captions

    The dataset used in the paper "Scaling Laws of Synthetic Images for Model Training". The dataset is used for supervised image classification and zero-shot classification tasks.
  • COCO 2017 Detection Dataset

    A large dataset for object detection, containing 118k training images and 5k validation images.
  • NUS-WIDE

    The dataset used in the paper is a multi-view clustering dataset, which contains 6 views of 30000 samples each. The dataset is used to evaluate the performance of the proposed...
  • VQA: Visual Question Answering

    Visual Question Answering (VQA) has emerged as a prominent multi-discipline research problem in both academia and industry.
  • REDD dataset

    The REDD dataset is a dataset for energy disaggregation. It contains about half month power consumption from real homes in US, for the whole house as well as for each individual...
  • The UK-DALE dataset

    The UK-DALE dataset contains measurements of aggregate and appliance power consumption in five UK homes.
  • Shapenet

    Shapenet is a large-scale synthesis 3D object dataset, where we follow [9] to use the official test splits of chair, car, and motorbike categories for evaluation since they...
  • Benchmark Fair Classification Dataset

    The dataset used in the paper for fair subgroup mixup for improving group fairness.
  • Law School Admission Bar Passage

    The dataset used in the paper for fair subgroup mixup for improving group fairness.
  • Flickr30k

    The Flickr30k dataset is widely utilized for image caption and image-text retrieval tasks, providing a substantial collection of images with associated captions.
  • Dataset

    The dataset used in this paper consists of command speech, conversation speech, emotion and speaking style-specific speech, Korean speakers' foreign language speech, and Korean...
  • Sintel Dataset

    The dataset used in the paper is a Sintel dataset, which consists of low-resolution optical flow maps and their corresponding high-resolution RGB images.
  • Caltech101

    The dataset used in the paper is Caltech101, which is a natural image classification dataset. It contains 101 categories of natural images.
You can also access this registry using the API (see API Docs).