WebVid-10M: A large-scale video dataset for text-to-video generation

WebVid-10M: A large-scale video dataset for text-to-video generation.

BibTex: