ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator

Fine-grained object discrimination using Vision Transformer

Data and Resources

Cite this as

Zhen-Duo Chen, Zi-Chao Zhang, Yongxin Wang, Xin Luo, Xin-Shun Xu (2024). Dataset: ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator. https://doi.org/10.57702/t4l2hu7j

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Zhen-Duo Chen
More Authors
Zi-Chao Zhang
Yongxin Wang
Xin Luo
Xin-Shun Xu