VATEX

The dataset used in the paper is a video question answering dataset, which is a large-scale video-language pre-training task.

BibTex: