2 datasets found

Tags: Language-Image Pre-training

Filter Results
  • DEMYSTIFYING CLIP DATA

    Contrastive Language-Image Pre-training (CLIP) is an approach that has advanced research and applications in computer vision, fueling modern recognition systems and generative...
  • BLIP-2

    BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models
You can also access this registry using the API (see API Docs).