Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec 17;110(51):20380-5.
doi: 10.1073/pnas.1320578110. Epub 2013 Dec 2.

A maximum entropy framework for nonexponential distributions

Affiliations

A maximum entropy framework for nonexponential distributions

Jack Peterson et al. Proc Natl Acad Sci U S A. .

Abstract

Probability distributions having power-law tails are observed in a broad range of social, economic, and biological systems. We describe here a potentially useful common framework. We derive distribution functions for situations in which a "joiner particle" k pays some form of price to enter a community of size , where costs are subject to economies of scale. Maximizing the Boltzmann-Gibbs-Shannon entropy subject to this energy-like constraint predicts a distribution having a power-law tail; it reduces to the Boltzmann distribution in the absence of economies of scale. We show that the predicted function gives excellent fits to 13 different distribution functions, ranging from friendship links in social networks, to protein-protein interactions, to the severity of terrorist attacks. This approach may give useful insights into when to expect power-law distributions in the natural and social sciences.

Keywords: fat tail; heavy tail; social physics; statistical mechanics; thermostatistics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
The joining cost for a particle to join a size formula image community is formula image. This diagram can describe particles forming colloidal clusters, or social processes such as people joining cities, citations added to papers, or link creation in a social network.
Fig. 2.
Fig. 2.
Eq. 7 gives good fits (formula image; see SI Text for details) to 13 empirical distributions, with the values of formula image and formula image given in Table 1. Points are empirical data, and lines represent best-fit distributions. The probability formula image of exactly k is shown in blue, and the probability of at least k (the complementary cumulative distribution, formula image) is shown in red. Descriptions and references for these datasets can be found in SI Text.
Fig. 3.
Fig. 3.
Eq. 7 fitted to the 13 datasets in Table 1, plotted against the total cost to assemble a size k community, formula image. Values of formula image and formula image are shown in Table 1. The y axis has been rescaled by dividing by the maximum formula image, so that all curves begin at formula image. All data sets are fit by the formula image line. See Fig. 2 for fits to individual datasets.

References

    1. Mantegna R, Stanley H. Scaling behaviour in the dynamics of an economic index. Nature. 1995;376:46–49.
    1. Zipf GK. Human Behavior and the Principle of Least Effort. Cambridge, MA: Addison-Wesley; 1949.
    1. Broder A, et al. Graph structure in the web. Comput Netw. 2000;33(1-6):309–320.
    1. Newman M. Power laws, Pareto distributions and Zipf’s law. Contemp Phys. 2005;46(5):323–351.
    1. Clauset A, Shalizi C, Newman M. Power-law distributions in empirical data. SIAM Rev. 2009;51(4):661–703.

Publication types