Next-QA

A video question answering dataset that focuses on visually grounded video question answering.

BibTex: