Taylor's Law in Innovation Processes

Francesca Tria¹, Irene Crimaldi², Giacomo Aletti³, Vito D P Servedio⁴

Affiliations

¹ Physics Department, Sapienza University of Rome, P.le Aldo Moro 5, 00185 Rome, Italy.
² IMT School for Advanced Studies Lucca, Piazza San Ponziano 6, 55100 Lucca, Italy.
³ ADAMSS Center, Università Degli Studi di Milano, 20133 Milan, Italy.
⁴ Complexity Science Hub Vienna, Josefstädter Strasse 39, A-1080 Vienna, Austria.

PMID: 33286342
PMCID: PMC7517092
DOI: 10.3390/e22050573

Taylor's Law in Innovation Processes

Francesca Tria et al. Entropy (Basel). 2020.

. 2020 May 19;22(5):573.

doi: 10.3390/e22050573.

Authors

Francesca Tria¹, Irene Crimaldi², Giacomo Aletti³, Vito D P Servedio⁴

Affiliations

¹ Physics Department, Sapienza University of Rome, P.le Aldo Moro 5, 00185 Rome, Italy.
² IMT School for Advanced Studies Lucca, Piazza San Ponziano 6, 55100 Lucca, Italy.
³ ADAMSS Center, Università Degli Studi di Milano, 20133 Milan, Italy.
⁴ Complexity Science Hub Vienna, Josefstädter Strasse 39, A-1080 Vienna, Austria.

PMID: 33286342
PMCID: PMC7517092
DOI: 10.3390/e22050573

Abstract

Taylor's law quantifies the scaling properties of the fluctuations of the number of innovations occurring in open systems. Urn-based modeling schemes have already proven to be effective in modeling this complex behaviour. Here, we present analytical estimations of Taylor's law exponents in such models, by leveraging on their representation in terms of triangular urn models. We also highlight the correspondence of these models with Poisson-Dirichlet processes and demonstrate how a non-trivial Taylor's law exponent is a kind of universal feature in systems related to human activities. We base this result on the analysis of four collections of data generated by human activity: (i) written language (from a Gutenberg corpus); (ii) an online music website (Last.fm); (iii) Twitter hashtags; (iv) an online collaborative tagging system (Del.icio.us). While Taylor's law observed in the last two datasets agrees with the plain model predictions, we need to introduce a generalization to fully characterize the behaviour of the first two datasets, where temporal correlations are possibly more relevant. We suggest that Taylor's law is a fundamental complement to Zipf's and Heaps' laws in unveiling the complex dynamical processes underlying the evolution of systems featuring innovation.

Keywords: Poisson–Dirichlet process; Pólya’s urn; Taylor’s law; adjacent possible; innovation dynamics; triangular urn schemes.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
**Taylor’s law in the urn model with triggering.***Left:* Taylor’s law from 100 realizations of the stochastic process described in Section 2 (the urn model with triggering), for each of the indicated values parameters. The values of parameters are chosen in order to have a representative curve for each of the analyzed regimes, i.e., $ρ < ν / 2$ , $ρ = ν / 2$ , $ν / 2 < ρ < ν$ , $ρ = ν$ , $ρ > ν$ . Each realization is a sequence of $10^{6}$ elements. *Right:* Taylor’s law from the same sequences as in the left side picture, individually reshuffled so that to loose the temporal order (refer to the *parallel file random* reshuffling procedure discussed in Section 5 and in Figure 2).

**Figure 2**
**Shuffling procedures.** In this example we consider three different streams A, B, C, consisting of five tokens each. When the analysis is carried in *parallel* the streams are aligned respecting their natural order (left panel). In the *parallel file random* case (middle panel), each stream is reshuffled singularly. Eventually, the parallel random case shuffles the elements all together (right panel).

**Figure 3**
**Taylor’s law in real systems and in their randomized instances.** The standard deviation $σ (N)$ of the number of different tokens after N total tokens appeared, is plotted vs the average number of different tokens $μ (N)$ in four different datasets. The shuffled counterparts are also evaluated. The shuffling schemes are shown in Figure 2.

**Figure 4**
**Stability of Taylor’s law results in the Gutenberg corpus.***Left:* the analogous of Figure 3 (top left) for three different sets of $M = 100$ books from the Gutenberg corpus. *Right*: as in Figure 3 (top left), with 20 different realizations of the *parallel file random* reshuffling procedure. We see that the difference between the curve referred to the ordered sequences and those referred to the reshuffled ones is much higher than fluctuations due to different realizations of the reshuffling.

**Figure 5**
**Taylor’s law in the urn model with triggering and quenched stochasticity of the parameters and in the urn model with semantic triggering.***Top:* Taylor’s law in the urn model with triggering, with parameter’s $N_{0} = 100$ , $ρ = 1$ and $ν$ is randomly extracted for each simulation of the process from a uniform distribution on the interval $(0, 1)$ (left) and from an exponential distribution on the interval $(0, 1)$ and parameter $λ = 1$ (right), as discussed in the main text. *Center:* Taylor’s law in the urn model with triggering, with parameter’s respectively: (left) $N_{0} = 100$ , $ν = 2$ , $ρ = 3 + r_{i}$ , with $r_{i}$ randomly extracted for each simulation of the process from an exponential distribution with mean $\bar{r_{i}} = 1$ ; $ν = 2$ , $ρ = 3$ , $N_{0} = 1 + n_{i}$ , with $n_{i}$ randomly extracted for each simulation of the process from an exponential distribution with mean $\bar{n_{i}} = 10^{4}$ . *Bottom:* (left) Taylor’s law in the urn model with semantic triggering, with parameters $N_{0} = 100$ , $ν = 6$ , $ρ = 9$ , $η = 0.6$ ; (right) Taylor’s law in the urn model with semantic triggering, with parameters $ν = 2$ , $ρ = 3$ , $η = 0.6$ , $N_{0} = 1 + n_{i}$ , with $n_{i}$ randomly extracted for each simulation of the process from an exponential distribution with mean $\bar{n_{i}} = 10^{4}$ . The parameters of the simulations were chosen such to lie in the regime $ρ < ν$ . The parameter $η = 0.6$ used in the bottom graphs was chosen in the regime where the Heaps’ and Zipfs’ laws feature exponents compatible with those observed in real systems. In all the figures the curves for the Taylor’s law are constructed from 100 independent realizations of the process (M = 100 in Equation (16)).

See this image and copyright information in PMC

References

1. Zipf G.K. Relative Frequency as a Determinant of Phonetic Change. Harv. Stud. Class. Philol. 1929;40:1–95. doi: 10.2307/310585. - DOI
1. Zipf G.K. The Psychobiology of Language. Houghton-Mifflin; New York, NY, USA: 1935.
1. Zipf G.K. Human Behavior and the Principle of Least Effort. Addison-Wesley; Reading MA, USA: 1949.
1. Herdan G. Type-Token Mathematics: A Textbook of Mathematical Linguistics. Mouton en Company; Berlin, Germany: 1960. (Janua linguarum. series maior. no. 4).
1. Heaps H.S. Information Retrieval-Computational and Theoretical Aspects. Academic Press; Cambridge, MA, USA: 1978.

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Taylor's Law in Innovation Processes

Affiliations

Taylor's Law in Innovation Processes

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources