Dataset Groups Activity Stream Groups Captioning View Captioning Cross-Modal Learning View Cross-Modal Learning Multimodal Learning View Multimodal Learning Speech Recognition View Speech Recognition Text-Based Video Retrieval View Text-Based Video Retrieval Text-Video Retrieval View Text-Video Retrieval Video Analysis View Video Analysis Video Captioning View Video Captioning Video Classification View Video Classification Video Description View Video Description Video Generation View Video Generation Video Question Answering View Video Question Answering Video Retrieval View Video Retrieval Video Understanding View Video Understanding Video-Text Retrieval View Video-Text Retrieval