No Organization - Organizations

MV-VTON: Multi-View Virtual Try-On with Diffusion Models

The proposed method for Multi-View Virtual Try-On (MV-VTON) task, which aims at using the frontal and back clothing to reconstruct the dressing results of a person from multiple...

Dataset
JSON

Dynamic Approach for Lane Detection using Google Street View and CNN

A dataset of 2000 RGB images for lane detection using SegNet architecture.

Dataset
JSON

NeRFBuster

A real-world dataset captured by mobile phones and containing quite complex trajectories.

Dataset
JSON

CF-NeRF

A novel end-to-end method that does not require prior camera parameters to deal with image sequences with complex trajectories.

Dataset
JSON

SPair-71k

The proposed method, dubbed Dynamic Hyperpixel Flow, learns to compose hypercolumn features on the fly by selecting a small number of relevant layers from a deep convolutional...

Dataset
JSON

Oxford RobotCar Dataset

The Oxford RobotCar Dataset is a collection of images and videos of a car driving on various roads and conditions.

Dataset
JSON

MNIST, CIFAR-10, CIFAR-100, Tiny-ImageNet, VGG-like

The dataset used in the paper is MNIST, CIFAR-10, CIFAR-100, Tiny-ImageNet, and VGG-like.

Dataset
JSON

7-Scenes

This paper proposes the use of Neural Radiance Fields (NeRF) as a scene representation for visual localization.

Dataset
JSON

Cambridge Landmarks

The Cambridge Landmarks dataset contains 5 different large outdoor scenes of landmarks in the city of Cambridge.

Dataset
JSON

Sparse Resnet50 model

The dataset used in this paper is a sparse Resnet50 model, which is a variant of the Resnet50 model with 80% sparsity.

Dataset
JSON

Two-level Group Convolution

The proposed two-level group convolution is suitable for distributed memory computing and robust with respect to the large number of groups.

Dataset
JSON

Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated ...

The proposed Win Transformer achieves consistently superior performance than Swin Transformer on multiple computer vision tasks, including image recognition, semantic...

Dataset
JSON

ResNet-50

The dataset used in the paper is the ResNet-50 dataset, a convolutional neural network model.

Dataset
JSON

ANTNets: Mobile Convolutional Neural Networks for Resource Efﬁcient Image Cla...

Deep convolutional neural networks have achieved remarkable success in computer vision. However, deep neural networks require large computing resources to achieve high...

Dataset
JSON

Traffic Signs dataset

The Traffic Signs dataset contains 39252 training images in 43 classes.

Dataset
JSON

Pose-Aware Video Transformers

Human perception of surroundings is often guided by the various poses present within the environment. Many computer vision tasks, such as human action recognition and robot...

Dataset
JSON

Cap3D dataset

The Cap3D dataset is a large-scale dataset of 3D models with captions.

Dataset
JSON

Objaverse-LVIS dataset

The Objaverse-LVIS dataset contains ∼ 46,000 3D models in 1,156 categories.

Dataset
JSON

ImageNet-1000

The dataset used in this paper is ImageNet-1000 pre-trained CNNs.

Dataset
JSON

Attentive Normalization

The proposed Attentive Normalization (AN) that aims to harness the best of feature normalization and feature attention in a single lightweight module.

Dataset
JSON

1,082 datasets found