LAION-Aesthetic-3M

The dataset used for training the prior model, containing 2M text-image pairs and 2M audio-visual pairs.

BibTex: