-
RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Imag...
Fine-grained image recognition (FGIR) has been a challenging problem. Most of the current methods are dominated by convolutional neural networks (CNNs). FGIR has the problem of... -
ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Fine-grained object discrimination using Vision Transformer