-
Animal Kingdom
A large and diverse dataset for animal behavior understanding. -
Hierarchical 3D fully convolutional networks for multi-organ segmentation
A two-stage, coarse-to-fine approach that trains an FCN model to roughly delineate the organs of interest in the first stage and then uses these predictions of the first-stage... -
X-volution: On the Unification of Convolution and Self-attention
Convolution and self-attention are acting as two fundamental building blocks in deep neural networks, where the former extracts local image features in a linear way while the... -
ConvMLP: Hierarchical Convolutional MLPs for Vision
ConvMLP: a Hierarchical Convolutional MLP backbone for visual recognition -
MNIST and FashionMNIST
The MNIST and FashionMNIST datasets are used to test the performance of the proposed generative autoencoders. -
FMoW-WILDS
Functional map of the world. -
LiqD: A Dynamic Liquid Level Detection Model
A container dynamic liquid level detection model based on U²-Net. -
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Seg...
Weakly-supervised semantic segmentation using plain Vision Transformer (ViT) for Weakly-supervised Semantic Segmentation (WSSS). -
Cloud-removal-from-solar-imagery
The dataset used for removing cloud shadows from ground-based solar imagery. -
Classification of Diabetic Retinopathy using Pre-Trained Deep Learning Models
Diabetic Retinopathy dataset containing 1000 color fundus images from KAGGLE -
A Deep Neural Network for Multiclass Bridge Element Parsing in Inspection Ima...
Aerial robots such as drones have been leveraged to perform bridge inspections. Inspection images with both recognizable structural elements and apparent surface defects can be... -
Object Tracking Benchmark
The OTB100 dataset is an extension of the OTB50 dataset, containing 100 videos with 1000 frames each. -
VIGOR dataset
The VIGOR dataset contains images of buildings and streets from different angles and lighting conditions. -
Stanford-Cars, Oxford-Flowers102, Oxford-IIIT Pets, FGVC Aircraft, CIFAR-10 d...
Stanford-Cars, Oxford-Flowers102, Oxford-IIIT Pets, FGVC Aircraft, CIFAR-10 datasets. -
Tied-Augment: Controlling Representation Similarity Improves Data Augmentation
Data augmentation methods have played an important role in the recent advance of deep learning models, and have become an indispensable component of state-of-the-art models in... -
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Designing convolutional neural networks (CNN) for mobile devices is challenging because mobile models need to be small and fast, yet still accurate. Although significant efforts... -
ARS: Augmented Reality Semi-automatic-labeling
Two novel datasets are created using the ARS pipeline, one on electromechanical components (industrial scenario) and one on fruits (daily-living scenario). -
Transform Quantization for CNN Compression
The dataset used in this paper is a collection of convolutional neural network (CNN) weights, which are compressed using transform quantization. -
Engineering the Neural Collapse Geometry of Supervised-Contrastive Loss
Supervised-contrastive loss (SCL) is an alternative to cross-entropy (CE) for classification tasks that makes use of similarities in the embedding space to allow for richer...