Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Oct 5;101(40):14333-7.
doi: 10.1073/pnas.0405728101. Epub 2004 Sep 24.

Entropy of dialogues creates coherent structures in e-mail traffic

Affiliations

Entropy of dialogues creates coherent structures in e-mail traffic

Jean-Pierre Eckmann et al. Proc Natl Acad Sci U S A. .

Abstract

We study the dynamic network of e-mail traffic and find that it develops self-organized coherent structures similar to those appearing in many nonlinear dynamic systems. Such structures are uncovered by a general information theoretic approach to dynamic networks based on the analysis of synchronization among trios of users. In the e-mail network, coherent structures arise from temporal correlations when users act in a synchronized manner. These temporally linked structures turn out to be functional, goal-oriented aggregates that must react in real time to changing objectives and challenges (e.g., committees at a university). In contrast, static structures turn out to be related to organizational units (e.g., departments).

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Spike trains for the three communication channels determining the edges of a triangle formed by users A, B, and C. We have Ip(A,B) = 0.095, Ip(A,C) = 0.394, Ip(B,C) = 0.172, and IT = 1.606. It is important to note that Ip and IT capture the synchronization of the e-mail exchange at two different levels. IT measures the coherence of the triangle as a whole and can take on high values, even though some of the Ip values are relatively small. The horizontal line at the bottom represents the whole period under analysis divided into days (and weekends) and was introduced to aid the visualization of the events determining the probabilities entering Ip and IT (see Eqs. 1 and 2).
Fig. 2.
Fig. 2.
Probability distribution of the response time until a message is answered (as described in The Model). (Inset) The same probability distribution is measured in “ticks,” i.e., units of messages sent in the system. Binning is logarithmic. Solid lines approximately follow Δt-1 and are meant as guides for the eye.
Fig. 3.
Fig. 3.
Graph of static and temporal statistical quantities. Probability distribution of the number of triangles T in which a user participates. Circles indicate static triangles, whereas crosses indicate temporally coherent triangles (i.e., mutual information IT ≥ 0.1). Both lines are well fitby NT-1.2, and the black line is provided as a guide for the eye with this slope.
Fig. 4.
Fig. 4.
The static structure of the graph of e-mail traffic obtained from our data, arranged according to curvature, based on triangles of mutual recognition. Time is thus not taken into account, and the graph of users arranges itself primarily according to departments, shown in various colors.
Fig. 5.
Fig. 5.
The conjugate graph, for a cutoff of 0.5. Each node is a triangle of three people conferring with temporal coherence IT ≥ 0.5 (rather than a single user, as in Fig. 4), and each link connects two adjacent such triangles. The three colors of each node indicate the departments of the three people composing that node (we used the same color code as in Fig. 4). Note the strong clustering of the graph into very compact groups of people. The users cross departmental boundaries (their interests and connections are not shown, in consideration of privacy).
Fig. 6.
Fig. 6.
Cumulative probability graph for IT. The solid lines are fits to two different exponentials, and they cross at the critical point of IT ≈ 0.65. Axes are semilogarithmic. α is the threshold value.
Fig. 7.
Fig. 7.
Comparison of dynamic (red) and static (blue) groups. (Upper) Time sequence for the 83 days on which data were taken, showing two groups of comparable size (dynamic group, 6 participants and a total of 505 events over the whole period; static group, 10 participants and a total of 389 events), with the sum of the activity normalized to 1. (Lower Left) Graph of events for the two groups shown in Upper, after removing weekends. (Lower Right) Averaged graph for all dynamic and all static groups after rescaling (on the x axis) by the maximal number of events and normalizing the distribution (on the y axis) to unity. norm. units, Normalized units; arb. units, arbitrary units.

Similar articles

Cited by

References

    1. Eckmann, J.-P. & Moses, E. (2002) Proc. Natl. Acad. Sci. USA 99, 5825-5829. - PMC - PubMed
    1. Milo, R., Shen-Orr, S., Itzkovitz, S., Kashtan, N., Chklovskii, D. & Alon, U. (2002) Science 298, 824-827. - PubMed
    1. Albert, R. & Barabási, A.-L. (1999) Rev. Mod. Phys. 74, 47-97.
    1. Kleinberg, J. M. & Lawrence, S. (2001) Science 294, 1849-1850. - PubMed
    1. Cover, T. M. & Thomas, J. A. (1991) Elements of Information Theory (Wiley, New York).

LinkOut - more resources