1 dataset found

Tags: multiple actions

Filter Results
  • ActivityNet-QA

    Video question answering (VideoQA) is an essential task in vision-language understanding, which has attracted numerous research attention recently. Nevertheless, existing works...
You can also access this registry using the API (see API Docs).