You're currently viewing an old version of this dataset. To see the current version, click here.

RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition

Fine-grained image recognition (FGIR) has been a challenging problem. Most of the current methods are dominated by convolutional neural networks (CNNs). FGIR has the problem of large intra-class variance and small inter-class variance. Therefore, FGIR methods need to be able to identify and localize region attention in an image that is critical for classification.

Data and Resources

Cite this as

Yunqing Hu, Xuan Jin, Yin Zhang, Haiwen Hong, Jingfeng Zhang, Yuan He, Hui Xue (2024). Dataset: RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition. https://doi.org/10.57702/hm8kfsf9

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Author Yunqing Hu
More Authors
Xuan Jin
Yin Zhang
Haiwen Hong
Jingfeng Zhang
Yuan He
Hui Xue
Homepage https://doi.org/10.1145/1122445.1122456