Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

doi:doi:10.57702/q1hf7hg0

Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

This paper investigates whether a VLP can be compressed and debiased simultaneously by searching sparse and robust subnetworks.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Qingyi Si, Yuanxin Liu, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang (2024). Dataset: Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering. https://doi.org/10.57702/q1hf7hg0

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.2210.14558
Author	Qingyi Si
More Authors	Yuanxin Liu Zheng Lin Peng Fu Yanan Cao Weiping Wang