Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

Sketch-based object localization in natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the target image.

Data and Resources

Cite this as

Aditay Tripathi, Anand Mishra, Anirban Chakraborty (2024). Dataset: Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch. https://doi.org/10.57702/tx3duntn

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Aditay Tripathi
More Authors
Anand Mishra
Anirban Chakraborty