Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 22;11(1):31.
doi: 10.3390/healthcare11010031.

Adverse Effects of COVID-19 Vaccination: Machine Learning and Statistical Approach to Identify and Classify Incidences of Morbidity and Postvaccination Reactogenicity

Affiliations

Adverse Effects of COVID-19 Vaccination: Machine Learning and Statistical Approach to Identify and Classify Incidences of Morbidity and Postvaccination Reactogenicity

Md Martuza Ahamad et al. Healthcare (Basel). .

Abstract

Good vaccine safety and reliability are essential for successfully countering infectious disease spread. A small but significant number of adverse reactions to COVID-19 vaccines have been reported. Here, we aim to identify possible common factors in such adverse reactions to enable strategies that reduce the incidence of such reactions by using patient data to classify and characterise those at risk. We examined patient medical histories and data documenting postvaccination effects and outcomes. The data analyses were conducted using a range of statistical approaches followed by a series of machine learning classification algorithms. In most cases, a group of similar features was significantly associated with poor patient reactions. These included patient prior illnesses, admission to hospitals and SARS-CoV-2 reinfection. The analyses indicated that patient age, gender, taking other medications, type-2 diabetes, hypertension, allergic history and heart disease are the most significant pre-existing factors associated with the risk of poor outcome. In addition, long duration of hospital treatments, dyspnoea, various kinds of pain, headache, cough, asthenia, and physical disability were the most significant clinical predictors. The machine learning classifiers that are trained with medical history were also able to predict patients with complication-free vaccination and have an accuracy score above 90%. Our study identifies profiles of individuals that may need extra monitoring and care (e.g., vaccination at a location with access to comprehensive clinical support) to reduce negative outcomes through classification approaches.

Keywords: COVID-19; adverse reactions; comorbidities; machine learning; statistical analysis; symptoms; vaccination.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interest.

Figures

Figure 1
Figure 1
The schematic diagram of the overall workflow including data processing, data division, analysis using statistical and machine learning methods, and, at the end, performance evaluation with finding significant features.
Figure 2
Figure 2
Pearson’s correlation heat-map for the dataset medical history.
Figure 3
Figure 3
Pearson’s correlation heat-map for the dataset adverse reactions.
Figure 4
Figure 4
The significant features within the patients’ medical history, where the higher bar length indicates greater the significance.
Figure 5
Figure 5
The significant patients’ adverse reactions after vaccination, where the higher bar length indicates greater the significance.
Figure 6
Figure 6
Area Under the ROC curves for the machine learning model evaluation. (A). classification of died patients’ using patients’ medical history dataset; (B). classification of SARS-CoV-2 positive patients’ using patients’ medical history dataset; (C). classification of hospitalised patients’ using patients’ medical history dataset; (D). classification of died patients’ using patients’ reaction dataset; (E). classification of SARS-CoV-2 positive patients’ using patients’ reaction dataset; (F). classification of hospitalized patients’ using patients’ reaction dataset.
Figure 7
Figure 7
The features ranking according to the coefficient values of the patients’ medical history, calculated after machine learning model training. ML model outcomes indicate that higher coefficient values are mostly close to the significant association of severity.
Figure 8
Figure 8
The features ranking according to the coefficient values of the patients’ adverse reactions, calculated after machine learning model training. ML model outcomes indicate that higher coefficient values are mostly close to the significant association of severity.

References

    1. Wu Z., McGoogan J.M. Characteristics of and important lessons from the coronavirus disease 2019 (COVID19) outbreak in China: Summary of a report of 72314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323:1239–1242. doi: 10.1001/jama.2020.2648. - DOI - PubMed
    1. Aouissi H.A., Kechebar M.S.A., Ababsa M., Roufayel R., Neji B., Petrisor A.-I., Hamimes A., Epelboin L., Ohmagari N. The importance of behavioral and native factors on covid-19 infection and severity: Insights from a preliminary cross-sectional study. Healthcare. 2022;10:1341. doi: 10.3390/healthcare10071341. - DOI - PMC - PubMed
    1. Ahamad M.M., Aktar S., Rashed-Al-Mahfuz M., Uddin S., Liò P., Xu H., Summers M.A., Quinn J.M.W., Moni M.A. A machine learning model to identify early stage symptoms of SARS-Cov-2 infected patients. Expert Syst. Appl. 2020;160:113661. doi: 10.1016/j.eswa.2020.113661. - DOI - PMC - PubMed
    1. Coronavirus Cases: Worldometer. [(accessed on 7 December 2022)]. Available online: https://www.worldometers.info/coronavirus/
    1. Mutiso F., Pearce J.L., Benjamin-Neelon S.E., Mueller N.T., Li H., Neelon B. Bayesian negative binomial regression with spatially varying dispersion: Modeling COVID-19 incidence in Georgia. Spat. Stat. 2022;52:100703. doi: 10.1016/j.spasta.2022.100703. - DOI - PMC - PubMed

LinkOut - more resources