You're currently viewing an old version of this dataset. To see the current version, click here.
Original Metadata
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Hang Zhang, Xin Li, Lidong Bing (2024). Dataset: Video-LLaMA: An instruction-tuned audio-visual language model for video understanding. Resource: Original Metadata. https://doi.org/10.57702/ztz8frfm
DOI retrieved: December 3, 2024
Additional Information
Field | Value |
---|---|
Created | unknown |
Last updated | December 3, 2024 |
Format | application/json |