1 dataset found

Tags: visio-linguistic compositionality

Filter Results
  • Winoground

    The Winoground dataset consists of 400 items, each containing two image-caption pairs (I0, C0), (I1, C1).
You can also access this registry using the API (see API Docs).