German Common Crawl

German Common Crawl is a dataset of web pages crawled from the internet.

Data and Resources

Cite this as

Laippala et al. (2025). Dataset: German Common Crawl. https://doi.org/10.57702/u8qduzr9

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.48550/arXiv.2403.08763
Author Laippala et al.
Homepage https://oscar.readthedocs.io/en/latest/datasets.html