Dataset - LDM

Scenenet RGB-D: 5M photorealistic images of synthetic indoor trajectories wit...

Scenenet RGB-D: 5M photorealistic images of synthetic indoor trajectories with ground truth
- Dataset
- JSON
MVSNeRF: Fast generalizable radiance field reconstruction from multi-view stereo

MVSNeRF: Fast generalizable radiance field reconstruction from multi-view stereo
- Dataset
- JSON
Multi-task view synthesis with Neural Radiance Fields

Multi-task view synthesis with neural radiance fields
- Dataset
- JSON
Tied Block Convolution: Leaner and Better CNNs with Shared Thinner Filters

Convolution is the main building block of convolutional neural networks (CNN). We observe that an optimized CNN often has highly correlated filters as the number of channels...
- Dataset
- JSON
AWA2 (Animals with Attributes 2)

The AWA2 dataset includes 50 classes of assorted animals totaling 37,322 samples, of which 10 categories are considered unseen classes. Attribute annotations are 85-dimensional.
- Dataset
- JSON
SUN (SUN Attribute)

The SUN dataset has a sample of 717 different scenes totaling 14,340 images, where 72 categories are unseen classes. Attribute annotations are 102-dimensional.
- Dataset
- JSON
CUB (Caltech UCSD Birds 200)

The CUB dataset comprises 200 bird species totaling 11,788 image samples, of which 50 categories are planned as unseen classes. The SUN dataset has a sample of 717 different...
- Dataset
- JSON
Deep Geometric Moment (DGM) Model

The proposed model consists of three components: 1) Coordinate base computation: uses a 2D coordinate grid as input and generates the bases, 2) Image feature computation:...
- Dataset
- JSON
Improving Shape Awareness and Interpretability in Deep Networks Using Geometr...

Deep networks for image classification often rely more on texture information than object shape. This paper presents a deep-learning model inspired by geometric moments, a...
- Dataset
- JSON
Osteoarthritis Initiative (OAI) dataset

Knee OsteoArthritis (KOA) dataset used for early detection of KOA (KL-0 vs KL-2) using Vision Transformer (ViT) model with selective shuffled position embedding and key-patch...
- Dataset
- JSON
S3DIS and ShapeNetPart

The dataset used for indoor scene segmentation and object part segmentation.
- Dataset
- JSON
STL-10 dataset

The dataset used in this paper is a collection of images from the STL-10 dataset, preprocessed and used for training and evaluation of the proposed diffusion spectral entropy...
- Dataset
- JSON
Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction

We introduce Omni-LOS, a neural computational imaging method for conducting holistic shape reconstruction (HSR) of complex objects utilizing a Single-Photon Avalanche Diode...
- Dataset
- JSON
Graph-Regularized Attentive Convolutional Entanglement for Robust DeepFake Vi...

The proposed GRACE method leverages feature entanglement with sparse constraints and a graph convolutional network with graph Laplacian smoothing prior regularization to...
- Dataset
- JSON
Real-World Depth of Field Dataset

The dataset consists of real captured scenes with varying exposures, apertures, and focus distances.
- Dataset
- JSON
Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field

The dataset consists of four synthetic rendered scenes and four real captured scenes. The synthetic scenes provide control and ground truth for evaluation, while the real...
- Dataset
- JSON
Thumbnail Generation Dataset

Thumbnail generation dataset used in the paper for training and testing the proposed model.
- Dataset
- JSON
DRIVE

The DRIVE dataset contains the curvilinear-shaped vessel. This dataset consists of 40 565 × 584 color retinal images, which are split into 20 training images and 20 test images.
- Dataset
- JSON
ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections

The proposed network architecture uses a threshold mechanism to further optimize the connection method, reducing connections between layers to accelerate inference time.
- Dataset
- JSON
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Visi...

The Vision Transformer (ViT) has gained prominence for its superior relational modeling prowess. However, its global attention mechanism’s quadratic complexity poses substantial...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

992 datasets found