Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 1;11(7):nwae073.
doi: 10.1093/nsr/nwae073. eCollection 2024 Jul.

Beyond network centrality: individual-level behavioral traits for predicting information superspreaders in social media

Affiliations

Beyond network centrality: individual-level behavioral traits for predicting information superspreaders in social media

Fang Zhou et al. Natl Sci Rev. .

Abstract

Understanding the heterogeneous role of individuals in large-scale information spreading is essential to manage online behavior as well as its potential offline consequences. To this end, most existing studies from diverse research domains focus on the disproportionate role played by highly connected 'hub' individuals. However, we demonstrate here that information superspreaders in online social media are best understood and predicted by simultaneously considering two individual-level behavioral traits: influence and susceptibility. Specifically, we derive a nonlinear network-based algorithm to quantify individuals' influence and susceptibility from multiple spreading event data. By applying the algorithm to large-scale data from Twitter and Weibo, we demonstrate that individuals' estimated influence and susceptibility scores enable predictions of future superspreaders above and beyond network centrality, and reveal new insights into the network positions of the superspreaders.

Keywords: complex networks; information spreading; social media; social networks; superspreaders.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Quantifying individual influence and susceptibility from observational data. To illustrate the intuition behind the proposed influence-susceptibility algorithm, in panel (a), we show how two nodes (A, B) with the same number of outgoing links and the same number of propagation events can achieve a widely different influence score I. The size of an orange (blue) node denotes its susceptibility score S (influence score I); the thickness of an arrow represents the number of propagation events. Both A and B have three outgoing links, meaning that they influence three nodes. However, A’s neighbors have a lower susceptibility than node B’s neighbors, because they are influenced fewer times by their other neighbors (the blue nodes in the outer-most shell). As A influences less susceptible individuals than B does, according to the IS algorithm, A is more influential than B. With a similar argument, in panel (b), node C is more susceptible than D.
Figure 2.
Figure 2.
Empirical correlations between individual-level properties. We divide the spreading events into six consecutive non-overlapping periods. The date on the x axis represents the starting date of each period. In each period, we reconstruct individual influence and susceptibility via the IS algorithm, and measure the Spearman’s correlation coefficients, ρ, between the indegree (kin), outdegree (kout), influence (formula image) and susceptibility (formula image). Only those value pairs where both values are not equal to 0 are conserved. Filled markers denote correlation values that are significantly larger or smaller than the correlation values calculated on randomized networks (P < 0.05; see Note S4 for details); open markers denote correlation values that do not significantly differ from the correlation values calculated on randomized networks (P > 0.05). The strongest consistent empirical correlation is that between the outdegree (kout) and influence (formula image); the other correlations are weak or non-significant.
Figure 3.
Figure 3.
Empirical assortativity properties between degree, influence and susceptibility. We divide the spreading events into six consecutive non-overlapping periods. The date on the x axis represents the starting date of each period. In each period, we reconstruct individual influence and susceptibility via the IS algorithm, and we measure Spearman’s correlation coefficients, ρ, between an individual’s properties and her neighbors’ properties. The only consistent correlations are the positive one between an individual’s susceptibility (formula image) and the influence of the individuals she retweets (formula image), and the negative one between an individual’s outdegree (kout) and the susceptibility of the individuals who retweet from her (formula image).
Figure 4.
Figure 4.
Predicting superspreaders. We rely on the random forest classification algorithm and use different features as input to predict superspreaders, where the superspreaders are defined as individuals with top formula image spreading capacity. Panels (a), (b), (d) and (e) show the superspreaders predicting performance on Weibo COVID and Twitter COVID. Two metrics, AUPRC and precision, are adopted to measure the performance of models. Panels (c) and (f) show the feature importance resulting from training the combined model. Across all windows, the best-performing model is either the combined model or the behavior-based (IS) model, which points to the essential role of the IS scores for the superspreader prediction. Panels (c) and (f) show the feature importance obtained from training the combined model. IS-based features tend to be more important than centrality-based features.

Similar articles

Cited by

References

    1. Bak-Coleman JB, Alfano M, Barfuss W et al. Stewardship of global collective behavior. Proc Natl Acad Sci USA 2021; 118: e2025764118.10.1073/pnas.2025764118 - DOI - PMC - PubMed
    1. Lacassin R, Devès M, Hicks SP et al. Rapid collaborative knowledge building via twitter after significant geohazard events. Geosci Lett 2020; 3: 129–46.10.5194/gc-3-129-2020 - DOI
    1. Chen Y, Harper FM, Konstan J et al. Social comparisons and contributions to online communities: a field experiment on movielens. Am Econ Rev 2010; 100: 1358–98.10.1257/aer.100.4.1358 - DOI
    1. Centola D. How Behavior Spreads. Princeton: Princeton University Press, 2018.
    1. Huber B, Barnidge M, Gil de Zúñiga H et al. Fostering public trust in science: the role of social media. Public Underst Sci 2019; 28: 759–77.10.1177/0963662519869097 - DOI - PubMed