-
Video Instruction Data
A video-centric instruction dataset, composed of 7K detailed video descriptions and 4K video conversations. -
Ask-Anything
A video-centric multimodal instruction dataset, composed of thousands of videos associated with detailed descriptions and conversations.