Dataset - LDM

DEMYSTIFYING CLIP DATA

Contrastive Language-Image Pre-training (CLIP) is an approach that has advanced research and applications in computer vision, fueling modern recognition systems and generative...
- Dataset
- JSON
BLIP-2

BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found