WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation

Weakly-supervised semantic segmentation using plain Vision Transformer (ViT) for Weakly-supervised Semantic Segmentation (WSSS).

BibTex: