-
KITTI Benchmark Dataset
The KITTI benchmark dataset is used to evaluate the performance of the proposed method. The dataset contains large-scale outdoor sequences of images captured by a forward-facing... -
Query-guided Attention in Vision Transformers for Localizing Objects Using a ...
Sketch-based object localization in natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the... -
PASCAL VOC 2007
Multi-label image recognition is a practical and challenging task compared to single-label image classification.