TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

doi:doi:10.57702/j3ydf3ns

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

TOPA is a text-only pre-alignment framework for extending large language models for video understanding without the need for pre-training on real video data.

BibTex:

Before browse our site, please accept our cookies policy