Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Apr 6;101 Suppl 1(Suppl 1):5266-73.
doi: 10.1073/pnas.0307625100. Epub 2004 Feb 19.

The simultaneous evolution of author and paper networks

Affiliations

The simultaneous evolution of author and paper networks

Katy Börner et al. Proc Natl Acad Sci U S A. .

Abstract

There has been a long history of research into the structure and evolution of mankind's scientific endeavor. However, recent progress in applying the tools of science to understand science itself has been unprecedented because only recently has there been access to high-volume and high-quality data sets of scientific output (e.g., publications, patents, grants) and computers and algorithms capable of handling this enormous stream of data. This article reviews major work on models that aim to capture and recreate the structure and dynamics of scientific evolution. We then introduce a general process model that simultaneously grows coauthor and paper citation networks. The statistical and dynamic properties of the networks generated by this model are validated against a 20-year data set of articles published in PNAS. Systematic deviations from a power law distribution of citations to papers are well fit by a model that incorporates a partitioning of authors and papers into topics, a bias for authors to cite recent papers, and a tendency for authors to cite papers cited by papers that they have read. In this TARL model (for topics, aging, and recursive linking), the number of topics is linearly related to the clustering coefficient of the simulated paper citation network.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Process model in pseudo code. If no topics are considered then the number of topics is one, i.e., all papers and authors have the same topic. If no coauthors are considered then each paper has exactly one author. If the reference path length is 0 then no references are considered for citation. If no aging function is given then all papers have the same probability of getting selected.
Fig. 2.
Fig. 2.
Author-paper network generated by using the model with topics only (a) and coauthors only (b). The model was started with five topics, authors, and papers and run for 2 years. In each year, each author produces one paper, which cites two earlier papers. No authors were added or deactivated. The resulting networks has five authors (labeled a1-a5, blue circles) and 15 or 9 papers (labeled 0, 2, 3..., red triangles). Papers are linked via red directed provided input to links. Authors are connected by blue coauthorships links. Light green indicates directed links denoting the flow of information from papers to authors and from authors to new papers via consumed and produced relations.
Fig. 3.
Fig. 3.
Paper network without aging (a) and with aging (b). Without aging, older papers are more likely to provide input to younger papers; i.e., they are attracting most of the citation links. For example, in a, paper no. 0 generated at initialization time and paper no. 5 generated in year 1 provide input to four and six papers, respectively.
Fig. 4.
Fig. 4.
Coverage of the PNAS data set in terms of time span, total papers, and complete authors' work.
Fig. 5.
Fig. 5.
Total number of actual and simulated papers (#p) and authors (#a) (a) and received citations (#c_win) (b).
Fig. 6.
Fig. 6.
Cluster coefficient as a function of the aging function (a) and the reference path length (b).

References

    1. White, H. D. & McCain, K. W. (1989) in Annual Review on Information Science and Technology, ed. Williams, M. E. (Elsevier, Amsterdam), Vol. 24, pp. 119-186.
    1. van Raan, A. F. J. (1997) Scientometrics 38, 205-218.
    1. Börner, K., Chen, C. & Boyack, K. (2003) in Annual Review of Information Science and Technology, ed. Cronin, B. (Information Today/Am. Soc. Information Science and Technology, Medford, NJ), Vol. 37, pp. 179-255.
    1. Albert, R. & Barabási, A.-L. (2002) Rev. Mod. Phys. 74, 47-97.
    1. Watts, D. J. (1999) Small Worlds: The Dynamics of Networks Between Order and Randomness (Princeton Univ. Press, Princeton, NJ).

Publication types

LinkOut - more resources