6 datasets found

Formats: JSON Tags: video description

Filter Results
  • YouCook

    A dataset of cooking videos with multiple sentence descriptions.
  • Movie Description dataset

    A novel dataset of movies with aligned descriptions sourced from movie scripts and DVS (Descriptive Video Service) audio descriptions.
  • TACoS

    A dataset of videos with multiple sentence descriptions, used for activity recognition and video description tasks.
  • HowTo100M

    The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.
  • MSVD

    Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning...
  • MSR-VTT

    The dataset used in the paper is MSR-VTT, a large video description dataset for bridging video and language. The dataset contains 10k video clips with length varying from 10 to...
You can also access this registry using the API (see API Docs).