Vision-by-Language for Training-Free Compositional Image Retrieval

doi:doi:10.57702/oz7xwa3m

Vision-by-Language for Training-Free Compositional Image Retrieval

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Vision-by-Language for Training-Free Compositional Image Retrieval

Compositional Image Retrieval through Vision-by-Language (CIReVL) is a training-free approach for Zero-Shot Compositional Image Retrieval (CIR). Utilizing off-the-shelf pre-trained models, CIReVL achieves strong performance across multiple CIR benchmarks.

BibTex:

Before browse our site, please accept our cookies policy