BLIP-2

BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models

BibTex: