Zipf's law leads to Heaps' law: analyzing their relation in finite-size systems
- PMID: 21152034
- PMCID: PMC2996287
- DOI: 10.1371/journal.pone.0014139
Zipf's law leads to Heaps' law: analyzing their relation in finite-size systems
Abstract
Background: Zipf's law and Heaps' law are observed in disparate complex systems. Of particular interests, these two laws often appear together. Many theoretical models and analyses are performed to understand their co-occurrence in real systems, but it still lacks a clear picture about their relation.
Methodology/principal findings: We show that the Heaps' law can be considered as a derivative phenomenon if the system obeys the Zipf's law. Furthermore, we refine the known approximate solution of the Heaps' exponent provided the Zipf's exponent. We show that the approximate solution is indeed an asymptotic solution for infinite systems, while in the finite-size system the Heaps' exponent is sensitive to the system size. Extensive empirical analysis on tens of disparate systems demonstrates that our refined results can better capture the relation between the Zipf's and Heaps' exponents.
Conclusions/significance: The present analysis provides a clear picture about the relation between the Zipf's law and Heaps' law without the help of any specific stochastic model, namely the Heaps' law is indeed a derivative phenomenon from the Zipf's law. The presented numerical method gives considerably better estimation of the Heaps' exponent given the Zipf's exponent and the system size. Our analysis provides some insights and implications of real complex systems. For example, one can naturally obtained a better explanation of the accelerated growth of scale-free networks.
Conflict of interest statement
Figures































Similar articles
-
Evolution of scaling emergence in large-scale spatial epidemic spreading.PLoS One. 2011;6(7):e21197. doi: 10.1371/journal.pone.0021197. Epub 2011 Jul 1. PLoS One. 2011. PMID: 21747932 Free PMC article.
-
Deviation of Zipf's and Heaps' Laws in human languages with limited dictionary sizes.Sci Rep. 2013;3:1082. doi: 10.1038/srep01082. Epub 2013 Jan 30. Sci Rep. 2013. PMID: 23378896 Free PMC article.
-
Log-Log Convexity of Type-Token Growth in Zipf's Systems.Phys Rev Lett. 2015 Jun 12;114(23):238701. doi: 10.1103/PhysRevLett.114.238701. Epub 2015 Jun 9. Phys Rev Lett. 2015. PMID: 26196834
-
Zipf's law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort.Psychon Bull Rev. 2023 Feb;30(1):77-101. doi: 10.3758/s13423-022-02142-9. Epub 2022 Jul 15. Psychon Bull Rev. 2023. PMID: 35840837 Free PMC article. Review.
-
Zipf's word frequency law in natural language: a critical review and future directions.Psychon Bull Rev. 2014 Oct;21(5):1112-30. doi: 10.3758/s13423-014-0585-6. Psychon Bull Rev. 2014. PMID: 24664880 Free PMC article. Review.
Cited by
-
Efficient learning strategy of Chinese characters based on network approach.PLoS One. 2013 Aug 21;8(8):e69745. doi: 10.1371/journal.pone.0069745. eCollection 2013. PLoS One. 2013. PMID: 23990887 Free PMC article.
-
Finite-size effects in transcript sequencing count distribution: its power-law correction necessarily precedes downstream normalization and comparative analysis.Biol Direct. 2018 Feb 12;13(1):2. doi: 10.1186/s13062-018-0204-y. Biol Direct. 2018. PMID: 29433547 Free PMC article.
-
Insights into group-specific pattern of secondary metabolite gene cluster in Burkholderia genus.Front Microbiol. 2024 Jan 16;14:1302236. doi: 10.3389/fmicb.2023.1302236. eCollection 2023. Front Microbiol. 2024. PMID: 38293557 Free PMC article. Review.
-
Waves of novelties in the expansion into the adjacent possible.PLoS One. 2017 Jun 8;12(6):e0179303. doi: 10.1371/journal.pone.0179303. eCollection 2017. PLoS One. 2017. PMID: 28594909 Free PMC article.
-
Evolution of scaling emergence in large-scale spatial epidemic spreading.PLoS One. 2011;6(7):e21197. doi: 10.1371/journal.pone.0021197. Epub 2011 Jul 1. PLoS One. 2011. PMID: 21747932 Free PMC article.
References
-
- Zipf GK. Human Behaviour and the Principle of Least Effort: An introduction to human ecology (Addison-Wesly, Cambridge) 1949.
-
- Heaps HS. Information Retrieval: Computational and Theoretical Aspects (Academic Press, Orlando) 1978.
-
- Clauset A, Shalizi CR, Newman MEJ. Power-law distributions in empirical data. SIAM Rev. 2009;51:661–703.
-
- Axtell RL. Zipf Distribution of U.S. Firm Sizes. Science. 2001;293:1818–1820. - PubMed
-
- Drăgulescu A, Yakovenko VM. Exponential and power-law probability distributions of wealth and income in the United Kingdom and the United States. Physica A. 2001;299:213–221.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources