-
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-...
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks. This paper introduces FocusCLIP, an enhancement for CLIP pretraining using a new ROI...