Openclip

Openclip: A large-scale multimodal dataset for vision and language understanding.

BibTex: