Dataset - LDM

The WASABI dataset

A large dataset of songs containing lyrics and other metadata about roughly 2M of songs in 21 languages.
- Dataset
- JSON
MS COCO dataset

The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each.
- Dataset
- JSON
YFCC100M

The dataset used in the paper is YFCC100M, a large-scale video dataset. The dataset is used for foreground and background patch extraction and object recognition tasks.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

3 datasets found