Adapting pre-trained visual language models in the low-data regime

The dataset used in the paper for task adaptation in the low-data regime, including COCO, Localized Narratives, ImageNet, and VQAv2.

BibTex: