Vision Transformers - Groups

SMMix

SMMix is a novel image mixing method that motivates both image and label enhancement by the model under training itself.

Dataset
JSON

SMMix: Self-Motivated Image Mixing for Vision Transformers

CutMix is a vital augmentation strategy that determines the performance and generalization ability of vision transformers (ViTs). However, the inconsistency between the mixed...

Dataset
JSON

Vision Transformers for Dense Prediction

A dataset for vision transformers

Dataset
JSON

Query-guided Attention in Vision Transformers for Localizing Objects Using a ...

Sketch-based object localization in natural images, where given a crude hand-drawn sketch of an object, the goal is to localize all the instances of the same object on the...

Dataset
JSON

4 datasets found

SMMix

SMMix: Self-Motivated Image Mixing for Vision Transformers

Vision Transformers for Dense Prediction

Query-guided Attention in Vision Transformers for Localizing Objects Using a ...