UCF101 dataset is used to test the proposed text-to-video model. The dataset contains 101 action categories, and each category has 10 videos. The videos are labeled with text descriptions.
BibTex:
Before browse our site, please accept our cookies policy