Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

This paper investigates whether a VLP can be compressed and debiased simultaneously by searching sparse and robust subnetworks.

Data and Resources

Cite this as

Qingyi Si, Yuanxin Liu, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang (2024). Dataset: Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering. https://doi.org/10.57702/q1hf7hg0

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2210.14558
Author Qingyi Si
More Authors
Yuanxin Liu
Zheng Lin
Peng Fu
Yanan Cao
Weiping Wang