Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021;6(1):48.
doi: 10.1007/s41109-021-00389-0. Epub 2021 Jun 30.

Large-scale analysis of delayed recognition using sleeping beauty and the prince

Affiliations

Large-scale analysis of delayed recognition using sleeping beauty and the prince

Takahiro Miura et al. Appl Netw Sci. 2021.

Abstract

Delayed recognition in which innovative discoveries are re-evaluated after a long period has significant implications for scientific progress. The quantitative method to detect delayed recognition is described as the pair of Sleeping Beauty (SB) and its Prince (PR), where SB refers to citation bursts and its PR triggers SB's awakeness calculated based on their citation history. This research provides the methods to extract valid and large SB-PR pairs from a comprehensive Scopus dataset and analyses how PR discovers SB. We prove that the proposed method can extract long-sleep and large-scale SB and its PR best covers the previous multi-disciplinary pairs, which enables to observe delayed recognition. Besides, we show that the high-impact SB-PR pairs extracted by the proposed method are more likely to be located in the same field. This indicates that a hidden SB that your research can awaken may exist closer than you think. On the other hand, although SB-PR pairs are fat-tailed in Beauty Coefficient and more likely to integrate separate fields compared to ordinary citations, it is not possible to predict which citation leads to awake SB using the rarity of citation. There is no easy way to limit the areas where SB-PR pairs occur or detect it early, suggesting that researchers and administrators need to focus on a variety of areas. This research provides comprehensive knowledge about the development of scientific findings that will be evaluated over time.

Keywords: Bibliometrics; Citation network; Delayed recognition; Interdisciplinary fusion; Prince; Production of knowledge; Sleeping beauty.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Overview of how a PR discovers an SB in the citation network. Each node represents papers and edges shows their citation. (1, left) As soon as an SB is submitted, it is indistinguishable from ordinary papers in the community without being cited much. (2, middle) When the PR realised the importance of the SB and cited it, it spread to the rest of the scientific community and led to the paper being co-cited. (3, right) A sub-field created by a pair of papers becomes widely recognised, and a new field can emerge from it
Fig. 2
Fig. 2
Publication year of papers. The black bar shows the whole number of papers published in the year. The orange bar represents the top5% in citation normalised by the average number of citations of papers submitted in the same field in the same year (Radicchi et al. 2008). The small graph shows the proportion of top papers to the total number of papers submitted in each year, which was stable at around 0.05 for papers in the period when there was sufficient time to obtain citations
Fig. 3
Fig. 3
Illustration of the definition of the beauty coefficient B and its awakening year ta. The black curve represents the number of citations ct received by the paper at age t. The awakening time tatm is defined as the age that maximises the distance from the line connceting points (0,c0) and (tm,ctm). B is calculated as a summation of the ratio of blue lines against each blue dot line from 0 to tm
Fig. 4
Fig. 4
Distribution of B for top papers. A red dot line indicates the least score of SBs behind the peak of the majority of papers
Fig. 5
Fig. 5
Distribution of citation in each year for all papers. Each coloured line refers to the average number of citation published in dotted point. Although citation inflation increases the citations to papers submitted in recent years, it does not increase the number of citations to older papers
Fig. 6
Fig. 6
Yearly citation curve of the lowest B-score in SB papers. The dotted line indicates the year ta when the paper was discovered
Fig. 7
Fig. 7
Sensitivity of clustering in 2020. Every sub-clustering, the maximum size of cluster is reduced by about 1/10, and the number of clusters is increased nearly tenfold
Fig. 8
Fig. 8
Cluster size in 2020. The cluster size increases on a log scale in the use of modularity maximisation
Fig. 9
Fig. 9
Publication year of SB and PR extracted in 2020. The orange bars show the distribution of submission years for SB, and the blue bars show the distribution of submission years for PR
Fig. 10
Fig. 10
Probability of gap year between SB and PR. Length of sleep duration for SB and PR pairs in the dataset divided into 5-year increments. The mean values are 16.0, 19.3, 22.7, 25.9, and 27.9 years, which is about half of the total length of the dataset
Fig. 11
Fig. 11
Illustration of the null model. Each node (paper) has a potential of ko (the number of black dot), which is defined by its 2020 citation, and every after adding new nodes every year, new edges are created with equal probability based on the remaining black dots
Fig. 12
Fig. 12
Distribution of B in the dataset and null model. The thin line is the result of log-normal fitting using the maximum likelihood method on each data
Fig. 13
Fig. 13
Density distribution of d for inter-disciplinary and intra-disciplinary at the time of their citation
Fig. 14
Fig. 14
Density distribution of d for random pairs and SB–PR pairs

References

    1. Asatani K, Mori J, Ochi M, Sakata I. Detecting trends in academic research from a citation network using network representation learning. PLoS ONE. 2018;13(5):0197260. doi: 10.1371/journal.pone.0197260. - DOI - PMC - PubMed
    1. Asatani K, Takeda H, Yamano H, Sakata I. Scientific attention to sustainability and SDGS: meta-analysis of academic papers. Energies. 2020;13(4):975. doi: 10.3390/en13040975. - DOI
    1. Baas J, Schotten M, Plume A, Côté G, Karimi R. Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies. Quant Sci Stud. 2020;1(1):377–386. doi: 10.1162/qss_a_00019. - DOI
    1. Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech: Theory Exp. 2008;2008(10):10008. doi: 10.1088/1742-5468/2008/10/P10008. - DOI
    1. Bornmann L, Ye AY, Ye FY. Identifying “hot papers” and papers with “delayed recognition” in large-scale datasets by using dynamically normalized citation impact scores. Scientometrics. 2018;116(2):655–674. doi: 10.1007/s11192-018-2772-0. - DOI - PMC - PubMed

LinkOut - more resources