-
ActivityNet-QA
Video question answering (VideoQA) is an essential task in vision-language understanding, which has attracted numerous research attention recently. Nevertheless, existing works... -
KnowIT VQA
A video story question answering dataset containing 24,282 questions about 207 episodes of The Big Bang Theory. -
Youtube2Text-QA
Video question answering task, which requires machines to answer questions about videos in a natural language form.