Scaling laws and fluctuations in the statistics of word frequencies

The dataset consists of three large databases: Google-ngram, English Wikipedia, and a collection of scientific articles.

Data and Resources

Cite this as

Martin Gerlach, Eduardo G. Altmann (2025). Dataset: Scaling laws and fluctuations in the statistics of word frequencies. https://doi.org/10.57702/dz5po0nq

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.1088/1367-2630/16/11/113010
Author Martin Gerlach
More Authors
Eduardo G. Altmann
Homepage https://arxiv.org/abs/1306.0321