-
ScanObjectNN
Zero-shot learning (ZSL) aims to classify objects that are not observed or seen during training. It relies on class semantic description to transfer knowledge from the seen... -
FlyingThings3D
Dense pixel matching is required for many computer vision algorithms such as disparity, optical flow or scene flow estimation. Feature Pyramid Networks (FPN) have proven to be a... -
SemanticKitti
SemanticKitti is a large-scale street view point cloud dataset for 3D semantic segmentation. -
PointConvFormer: Revenge of the Point-based Convolution
PointConvFormer is a novel point cloud layer that combines ideas from point convolution and transformers. -
CIFAR-10 and CIFAR-100 Datasets
The CIFAR-10 and CIFAR-100 datasets are used to evaluate the performance of the commentaries curriculum. -
Oxford 102 Flowers
Oxford 102 Flowers is a dataset of images of different flower species. -
Convolutional Networks with Adaptive Inference Graphs
ConvNet-AIG is a convolutional network that adaptsively defines its network topology conditioned on the input image. -
Scalable person re-identification: A benchmark
The Market-1501 dataset is a large-scale benchmark for person re-identification. -
Tursun et al.'s dataset
Tursun et al.'s dataset is used for testing the proposed HDR-Transformer. -
Sen et al.'s dataset
Sen et al.'s dataset is used for testing the proposed HDR-Transformer. -
Kalantari et al.'s dataset
Kalantari et al.'s dataset consists of 74 samples for training and 15 samples for testing. -
Subject2Vec: Generative-Discriminative Approach from a Set of Image Patches t...
A deep learning model that learns subject-level representation from a set of local features. The model represents the image volume as a bag (or set) of local features and can... -
Sintel Dataset
The dataset used in the paper is a Sintel dataset, which consists of low-resolution optical flow maps and their corresponding high-resolution RGB images. -
Structural Vision Transformer
Structural Vision Transformer (StructViT) is a vision transformer network that leverages structural self-attention (StructSA) to capture correlation structures in images and... -
Waste Classification using Computer Vision and Deep Learning
Dataset for waste classification using computer vision and deep learning