Deepfake Video Detection Using Generative Convolutional Vision Transformer

Deepfake video detection using Generative Convolutional Vision Transformer (GenConViT) for deepfake video detection. Our model combines ConvNeXt and Swin Transformer models for feature extraction, and it utilizes Autoencoder and Variational Autoencoder to learn from the latent data distribution.

BibTex: