TGIF-QA

The TGIF-QA dataset consists of 165165 QA pairs chosen from 71741 animated GIFs. To evaluate the spatiotemporal reasoning ability at the video level, TGIF-QA dataset designs four unique task types, i.e., repetition count, repeating action, state transition and frame QA.

Data and Resources

Cite this as

Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan (2024). Dataset: TGIF-QA. https://doi.org/10.57702/exmq0j83

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2406.18538
Citation
  • https://doi.org/10.48550/arXiv.1907.03049
  • https://doi.org/10.48550/arXiv.2210.03941
  • https://doi.org/10.48550/arXiv.2302.02136
  • https://doi.org/10.48550/arXiv.2008.09105
  • https://doi.org/10.48550/arXiv.2105.08276
Author Deng Huang
More Authors
Peihao Chen
Runhao Zeng
Qing Du
Mingkui Tan
Chuang Gan
Homepage https://github.com/SunDoge/L-GCN