When and why Vision-Language Models behave like Bags-of-Words, and what to do about it?

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

When and why Vision-Language Models behave like Bags-of-Words, and what to do about it?

When and why Vision-Language Models behave like Bags-of-Words, and what to do about it?

BibTex:

Before browse our site, please accept our cookies policy