Valley: A Video Assistant with Large Language Model Enhanced Ability

doi:doi:10.57702/5o0eqfc5

Valley: A Video Assistant with Large Language Model Enhanced Ability

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Valley: A Video Assistant with Large Language Model Enhanced Ability

A large multi-modal instruction-following dataset for video understanding, comprising 37k conversation pairs, 26k complex reasoning QA pairs and 10k detail description instruction pairs.

BibTex:

Before browse our site, please accept our cookies policy