-
CIFAR-10 and Fashion-MNIST datasets
The authors used the CIFAR-10 and Fashion-MNIST datasets for semi-supervised federated learning-based UAV image recognition tasks. -
Animals with Attributes (AwA) dataset
The Animals with Attributes (AwA) dataset consists of 50 classes of animals, with 30,475 images in total. -
Binary Image Database
A set of binary images of 9 classes, including bird, camel, children, elephant, fork, hammer, key, ray and turtle. -
DollarStreet
DollarStreet dataset includes 38k images of household objects across homes from 54 countries and with different income levels. -
Very Deep Convolutional Networks for Large-Scale Image Recognition
The dataset consists of 60,000 images of objects in 200 categories, with 300 images per category. -
Multi-digit number recognition from street view imagery using deep convolutio...
A dataset for CAPTCHA recognition -
SVHN Network
A dataset for CAPTCHA recognition -
ImageNet2012
The dataset used in the paper for attention-oriented data analysis and attention-based adversarial defense. -
ImageNet Large Scale Visual Recognition Challenge (ILSVRC)
The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories. -
LSUN Bedrooms
The dataset used in the paper is the LSUN bedrooms dataset, a large-scale image dataset. -
IMAGENET ILSVRC2012
The dataset used for image recognition using deep convolutional neural networks. -
Dogs vs Cats
The dataset used for image recognition using deep convolutional neural networks. -
Conformer: Local Features Coupling Global Representations
Conformer is a dual network structure that combines CNN-based local features with transformer-based global representations for enhanced representation learning. -
PASCAL VOC Dataset
The PASCAL VOC dataset contains 20 classes, including person, animal, vehicle, and indoor, with 9,963 images containing 24,640 annotated objects. -
MIT Indoor Scene Recognition
The MIT Indoor Scene Recognition dataset contains 67 categories of indoor scenes. -
VGG Network E
The dataset used in this paper is the VGG Network E, a deep convolutional neural network for image recognition. -
Sprites dataset
The dataset consists of binary images of sprites with variations in the shape (oval, square, and heart) and four geometric factors: scale (6 variation modes), rotation (40), and... -
Deep Image: Scaling up image recognition
Deep Image: Scaling up image recognition -
Holy Places Dataset
A dataset of images of holy places (Kaaba, Zamzam, Maqam Ibrahim) for training a deep learning model.