Microsoft Research Video Description Corpus (MSVD)

The MSVD dataset is a collection of 1970 open domain clips from YouTube, annotated with variable-length captions.

BibTex: