Intrinsic Dimensions of Language Fractal Structures

The dataset consists of embeddings of all n-grams of a natural language, constituting a representative sample of a language fractal structure.

BibTex: