Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

This paper investigates whether a VLP can be compressed and debiased simultaneously by searching sparse and robust subnetworks.

BibTex: