Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 8;15(1):43634.
doi: 10.1038/s41598-025-27487-8.

Leveraging complex network features improves vaccine stance classification

Affiliations

Leveraging complex network features improves vaccine stance classification

Durazzi Francesco et al. Sci Rep. .

Abstract

The widespread use of social media allows unprecedented ways to monitor opinions and stances regarding critical public health issues globally. Advanced Natural Language processing algorithms are being used routinely to extract information and classify vaccination hesitancy or stance. However, communication on online social networks such as Twitter (now X) is carried by short messages, the meaning of which can be difficult to understand in the absence of context. Therefore, in this study we propose the use of complex-network features extracted from the social network to integrate and enhance text-based Deep Learning models. Leveraging a dataset of about 20 million Italian language posts (of which about 7000 were manually annotated), we showed how the integration of text and network features improves vaccine stance classification, especially for the most polarized classes. Additionally, network features overperformed text features in a dataset collected a year after model training, possibly indicating how the social network changes more slowly than the trending words or topics.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Community distribution using Leiden algorithm fixing the 90% coverage, The first six communities cover the 90% of the first dataset, while the remaining 10% of the users is placed in the seventh community.
Fig. 2
Fig. 2
Network layout using fa2 algorithm. Different colors denote the distinct Leiden communities represented in the visualization. a: first dataset, b: second dataset, N: number of users.
Fig. 3
Fig. 3
F1 scores per class in (a) first test set and (b) in the second dataset, for the integrated classifier (fa2 + text), and classifiers based on fa2 and text separately.

References

    1. Kaur, S. P. & Gupta, V. COVID-19 vaccine: A comprehensive status report. Virus Res.288, 198114. 10.1016/j.virusres.2020.198114 (2020). - PMC - PubMed
    1. Schultz, É. Ward. Science under Covid-19’s magnifying glass: lessons from the first months of the chloroquine debate in the French press. J. Sociol.58 (1), 76–94. 10.1177/1440783321999453 (2022). - DOI
    1. Garrett, L. COVID-19: the medium is the message. Lancet395 (10228), 942–943. 10.1016/S0140-6736(20)30600-0 (2020). - DOI - PMC - PubMed
    1. Lorini, C. et al. Measuring health literacy in italy: a validation study of the HLS-EU-Q16 and of the HLS-EU-Q6 in Italian language, conducted in Florence and its surroundings. Ann. Ist Super Sanita. 55, 10–18. 10.4415/ANN_19_01_04 (2019). - DOI - PubMed
    1. Kashte, S., Gulbake, A., El-Amin, S. F. & Gupta, I. I. I. A. COVID-19 vaccines: rapid development, implications, challenges and future prospects. Hum. Cell.34 (3), 711–733. 10.1007/s13577-021-00512-4 (2021). - DOI - PMC - PubMed

LinkOut - more resources