-
BigEarthNet-MM
A large-scale benchmark archive for remote sensing image classification and retrieval. -
Wind Farm Dataset
The dataset is used to test the HCMAPPO algorithm for large-scale wind farm control. It includes 13, 16, 19, and 22 wind turbines with their coordinates, wind speeds, and... -
Nordland Railway dataset
The Nordland Railway dataset is a large-scale driving dataset that includes a 728km train journey from Trondheim to Bodø in Nordland, Norway, recorded four times, once per season. -
People’s Speech
The People’s Speech: A large-scale diverse English speech recognition dataset for commercial usage. -
DataComp-1B
The dataset used in the paper is also DataComp-1B, which is a large-scale dataset for training next-generation image-text models. -
Webvid-10M
The dataset used for training the video model consists of Webvid-10M, a large-scale dataset of short videos with textual descriptions. -
LAION COCO 600M
The dataset used for training the text-to-video model consists of 20 million videos and 600 million images. -
VoxCeleb: A Large-Scale Speaker Identification Dataset
VoxCeleb: A Large-Scale Speaker Identification Dataset -
LAION-Aesthetic
The dataset used in the paper is LAION-Aesthetic, a large-scale image dataset. -
Kinetics dataset
The Kinetics dataset is a large-scale action recognition dataset. It contains videos of various actions performed by humans, with annotations of the actions performed. -
UCF-101 dataset
UCF-101 dataset is a large-scale action recognition dataset, containing 13,320 videos categorized into 101 human action categories.