Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
The proposed Win Transformer achieves consistently superior performance than Swin Transformer on multiple computer vision tasks, including image recognition, semantic segmentation, and object detection.
BibTex: