Figure 3 | Scientific Reports

Figure 3

From: Zipf’s law holds for phrases, not words

Figure 3

Random partitioning distributions () for the four large corpora:

(A) Wikipedia (2010); (B) The New York Times (1987–2007); (C) Twitter (2009); and (D) Music Lyrics (1960–2007). Top right insets show the long tails of random partitioning distributions and the colors represent phrase length as indicated by the color bar. The gray curves are standard Zipf distributions for words (q = 1) and exhibit limited scaling and with clear scaling breaks. See main text and Tabs. S1–S4, for example phrases.

Back to article page