-
Confluence: A Robust Non-IOU Alternative to Non-Maxima Suppression in Object ...
Confluence is a novel non-Intersection over Union (IoU) alternative to Non-Maxima Suppression (NMS) in bounding box post-processing in object detection. -
The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
The unmanned aerial vehicle benchmark: Object detection and tracking. -
LVIS: A dataset for open-vocabulary object detection
A dataset for open-vocabulary object detection. -
R-CNN minus R
Deep convolutional neural networks (CNNs) have had a major impact in most areas of image understanding, including object category detection. -
SSDD dataset
SSDD dataset contains 1160 images of resolution from 1m to 15m in total, where the training set includes 928 images and the test set includes the remaining 232 images. -
DOTA dataset
DOTA dataset is a large-scale dataset for object detection in aerial images including 2806 aerial images with each image ranging from 800 × 800 to 4000 × 4000 pixels collected... -
COCO Test-Dev Set
The COCO test-dev set is a subset of the COCO detection dataset. -
COCO Validation Set
The COCO validation set is a subset of the COCO detection dataset. -
COCO Detection Dataset
The COCO detection dataset is a large-scale object detection benchmark. -
YOLOv3 Dataset
The YOLOv3 dataset is a dataset for object detection and classification. -
Improving Learning Effectiveness For Object Detection and Classification in Cl...
The proposed framework generates a training dataset in heterogeneous cluttered backgrounds for object detection and classification. -
Improvised Aerial Object Detection approach for YOLOv3 Using Weighted Luminance
Aerial imaging of ground targets is highly challenging because of various factors that affect light propagation through different mediums. Several convolutional neural... -
OpenImages dataset
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the OpenImages dataset to train their models. -
KITTI 2012
KITTI 2012 is a real-world dataset in the outdoor scenario, and contains 194 training and 195 testing stereo image pairs with the size of 376 × 1240. -
Query-guided Attention in Vision Transformers for Localizing Objects Using a ...
Sketch-based object localization in natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the...