Webvid10M

The dataset used for training the image-to-video model consists of LAION COCO 600M and Webvid10M.

BibTex: