-
MobileDepth: Efficient Monocular Depth Prediction on Mobile Devices
Depth prediction is fundamental for many useful applications on computer vision and robotic systems. On mobile phones, the performance of some useful applications as augmented... -
CIFAR10, CIFAR100, ImageNet
MobileNets, MnasNets, EfficientNets, and ResNets -
Residual Networks
Residual Networks (ResNet) is composed of stacked entities referred to as residual blocks. A Residual Block of ResNet contains a module and an identity loop. -
Real-world Vehicle Point Cloud
The dataset used in this paper is a real-world vehicle point cloud collected from a real vehicle self-driving process. -
PoseAction: Action Recognition for Patients in the Ward using Deep Learning A...
Real-time intelligent detection and prediction of subjects' behavior particularly their movements or actions is critical in the ward. -
DepthP+P: Metric Accurate Monocular Depth Estimation using Planar and Parallax
DepthP+P: A method for self-supervised monocular depth estimation using planar and parallax. -
3D Point Clouds
The dataset used in this paper is a collection of 3D point clouds. -
OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the ImageNet, Places205, and VOC07 datasets for evaluation. -
Trypophobia dataset
Dataset used for training and testing Convolutional Neural Networks for detecting trypophobia triggers. -
COCO Keypoint Benchmark
The COCO keypoint benchmark is a widely used dataset for human pose estimation. -
Context-and-Spatial Aware Network for Multi-Person Pose Estimation
Multi-person pose estimation is a fundamental yet challenging task in computer vision. Both rich context information and spatial information are required to precisely locate the... -
Faces Dataset
The dataset used in the paper for testing the GaMeS model, containing images of six people, generated using Blender software from various perspectives, excluding the backs of... -
Mip-NeRF360 dataset
The dataset used in the paper for testing the GaMeS model, containing 5 outdoor and 4 indoor scenes, each featuring intricate central objects or areas against detailed backgrounds. -
LSUN Bedroom and LSUN Cat dataset
The LSUN Bedroom and LSUN Cat dataset is a large-scale image dataset used for training and testing the proposed approach. -
Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small...
Skin cancer is one of the most common types of cancer in the world. Different computer-aided diagnosis systems have been proposed to tackle skin lesion diagnosis, most of them... -
Vision Big Bird
Vision Big Bird: Random Sparsification for Full Attention