1 dataset found

Formats: JSON Tags: Vision and Language Understanding

Filter Results
  • Openclip

    Openclip: A large-scale multimodal dataset for vision and language understanding.
You can also access this registry using the API (see API Docs).