LAION2B: An open large-scale dataset for training next generation image-text models

The LAION2B dataset is a massive 'in the wild' dataset used for training foundation diffusion models.

Data and Resources

Cite this as

Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman (2024). Dataset: LAION2B: An open large-scale dataset for training next generation image-text models. https://doi.org/10.57702/0bkwmliq

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2306.08687
Author Christoph Schuhmann
More Authors
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
Mehdi Cherti
Theo Coombes
Aarush Katta
Clayton Mullis
Mitchell Wortsman
Homepage https://arxiv.org/abs/2204.14896