-
YouTube-8M: A Large-Scale Video Classification Benchmark
YouTube-8M is a large-scale video classification benchmark. -
OpenImages
Large-scale vision-and-language models trained on curated and web-scrapped data have led to significant improvements over task-specific models when transferred to downstream... -
Microsoft COCO
The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...