Slot-VLM: SlowFast Slots for Video-Language Modeling

doi:doi:10.57702/ylnbizgr

Slot-VLM: SlowFast Slots for Video-Language Modeling

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Slot-VLM: SlowFast Slots for Video-Language Modeling

Video-Language Models (VLMs), powered by the advancements in Large Language Models (LLMs), are charting new frontiers in video understanding. A pivotal challenge is the development of an efficient method to encapsulate video content into a set of representative tokens to align with LLMs.

BibTex:

Before browse our site, please accept our cookies policy