Ask-Anything

A video-centric multimodal instruction dataset, composed of thousands of videos associated with detailed descriptions and conversations.

BibTex: