Diverse instance discovery: Vision-Transformer for instance-aware multi-label image recognition

Multi-label image recognition is a practical and challenging computer vision task. The authors propose a method to leverage the advantages of Transformer with long-range dependency modeling to circumvent the disadvantages of CNNs limited to local receptive fields.

Data and Resources

Cite this as

Yunqing Hu, Xuan Jin, Yin Zhang, Haiwen Hong, Jingfeng Zhang, Feihu Yan, Yuan He, Hui Xue (2024). Dataset: Diverse instance discovery: Vision-Transformer for instance-aware multi-label image recognition. https://doi.org/10.57702/juzm3om4

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Yunqing Hu
More Authors
Xuan Jin
Yin Zhang
Haiwen Hong
Jingfeng Zhang
Feihu Yan
Yuan He
Hui Xue