-
Deepfake Video Detection Using Generative Convolutional Vision Transformer
Deepfake video detection using Generative Convolutional Vision Transformer (GenConViT) for deepfake video detection. Our model combines ConvNeXt and Swin Transformer models for... -
S3T: Self-supervised pre-training with Swin Transformer for music classification
Self-supervised pre-training method with Swin Transformer for music classification, leveraging massive unlabeled music data to improve the performance of music classification... -
Vehicle Logo Recognition
Vehicle logo recognition using Swin Transformer -
WITT: A Wireless Image Transmission Transformer for Semantic Communications
The proposed WITT scheme is designed for wireless image transmission, and it uses the Swin Transformer as a backbone to extract long-range information.