A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions

doi:10.3390/s22228615

. 2022 Nov 8;22(22):8615.

doi: 10.3390/s22228615.

A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions

Argyro Mavrogiorgou¹, Athanasios Kiourtis¹, Spyridon Kleftakis¹, Konstantinos Mavrogiorgos¹, Nikolaos Zafeiropoulos¹, Dimosthenis Kyriazis¹

Affiliations

PMID: 36433212
PMCID: PMC9695983
DOI: 10.3390/s22228615

A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions

Argyro Mavrogiorgou et al. Sensors (Basel). 2022.

. 2022 Nov 8;22(22):8615.

doi: 10.3390/s22228615.

Authors

Argyro Mavrogiorgou¹, Athanasios Kiourtis¹, Spyridon Kleftakis¹, Konstantinos Mavrogiorgos¹, Nikolaos Zafeiropoulos¹, Dimosthenis Kyriazis¹

Affiliation

¹ Department of Digital Systems, University of Piraeus, 185 34 Piraeus, Greece.

PMID: 36433212
PMCID: PMC9695983
DOI: 10.3390/s22228615

Abstract

Extracting useful knowledge from proper data analysis is a very challenging task for efficient and timely decision-making. To achieve this, there exist a plethora of machine learning (ML) algorithms, while, especially in healthcare, this complexity increases due to the domain's requirements for analytics-based risk predictions. This manuscript proposes a data analysis mechanism experimented in diverse healthcare scenarios, towards constructing a catalogue of the most efficient ML algorithms to be used depending on the healthcare scenario's requirements and datasets, for efficiently predicting the onset of a disease. To this context, seven (7) different ML algorithms (Naïve Bayes, K-Nearest Neighbors, Decision Tree, Logistic Regression, Random Forest, Neural Networks, Stochastic Gradient Descent) have been executed on top of diverse healthcare scenarios (stroke, COVID-19, diabetes, breast cancer, kidney disease, heart failure). Based on a variety of performance metrics (accuracy, recall, precision, F1-score, specificity, confusion matrix), it has been identified that a sub-set of ML algorithms are more efficient for timely predictions under specific healthcare scenarios, and that is why the envisioned ML catalogue prioritizes the ML algorithms to be used, depending on the scenarios' nature and needed metrics. Further evaluation must be performed considering additional scenarios, involving state-of-the-art techniques (e.g., cloud deployment, federated ML) for improving the mechanism's efficiency.

Keywords: catalogue; data analysis; healthcare; machine learning; prediction; supervised learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 1**
Indicative example of BNB steps.

**Figure 2**
Indicative example of KNN steps.

**Figure 3**
Indicative example of DT steps.

**Figure 4**
Indicative example of RF steps.

**Figure 5**
Indicative example of LR steps.

**Figure 6**
Indicative example of MLP steps.

**Figure 7**
Indicative example of SGD steps.

**Figure 8**
Overall mechanism architecture.

**Figure 10**
Example of stroke probability form.

**Figure 11**
Precision results of ML models for each use case.

**Figure 12**
Recall results of ML models for each use case.

**Figure 13**
F1-score results of ML models for each use case.

**Figure 14**
Specificity results of ML models for each use case.

**Figure 15**
Train–validation–test score for diabetes use case.

**Figure 16**
Confusion matrix of prediction results for diabetes use case.

**Figure 17**
Performance comparison in the diabetes use case.

**Figure 18**
Train–validation–test score for stroke use case.

**Figure 19**
Confusion matrix of prediction results for stroke use case.

**Figure 20**
Performance comparison in the stroke use case.

**Figure 21**
Train–validation–test score for heart failure use case.

**Figure 22**
Confusion matrix of prediction results for heart failure use case.

**Figure 23**
Performance comparison in the heart failure use case.

**Figure 24**
Train–validation–test score for COVID-19 use case.

**Figure 25**
Confusion matrix of prediction results for COVID-19 use case.

**Figure 26**
Performance comparison in the COVID-19 use case.

**Figure 27**
Train–validation–test score for breast cancer use case.

**Figure 28**
Confusion matrix of prediction results for breast cancer use case.

**Figure 29**
Performance comparison in the breast cancer use case.

**Figure 30**
Train–validation–test score for kidney disease use case.

**Figure 31**
Confusion matrix of prediction results for kidney disease use case.

**Figure 32**
Performance comparison in the kidney disease use case.

**Figure 33**
Training performance comparison for each algorithm per dataset.

See this image and copyright information in PMC

Cited by

Risk Prediction Model for Chronic Kidney Disease in Thailand Using Artificial Intelligence and SHAP.
Tsai MC, Lojanapiwat B, Chang CC, Noppakun K, Khumrin P, Li SH, Lee CY, Lee HC, Khwanngern K. Tsai MC, et al. Diagnostics (Basel). 2023 Nov 28;13(23):3548. doi: 10.3390/diagnostics13233548. Diagnostics (Basel). 2023. PMID: 38066789 Free PMC article.
Unlocking stroke prediction: Harnessing projection-based statistical feature extraction with ML algorithms.
Sahriar S, Akther S, Mauya J, Amin R, Mia MS, Ruhi S, Reza MS. Sahriar S, et al. Heliyon. 2024 Mar 6;10(5):e27411. doi: 10.1016/j.heliyon.2024.e27411. eCollection 2024 Mar 15. Heliyon. 2024. PMID: 38495193 Free PMC article.
Special Issue: "Intelligent Systems for Clinical Care and Remote Patient Monitoring".
Sannino G, Celesti A, De Falco I. Sannino G, et al. Sensors (Basel). 2023 Sep 20;23(18):7993. doi: 10.3390/s23187993. Sensors (Basel). 2023. PMID: 37766047 Free PMC article.
Going paperless - Qualitative monitoring of staff morale during the transition from paper to electronic health records.
Baniulyte G, Rogerson N, Bowden J. Baniulyte G, et al. Heliyon. 2023 Oct 4;9(10):e20645. doi: 10.1016/j.heliyon.2023.e20645. eCollection 2023 Oct. Heliyon. 2023. PMID: 37867851 Free PMC article.
Prediction models for COVID-19 disease outcomes.
Tang CY, Gao C, Prasai K, Li T, Dash S, McElroy JA, Hang J, Wan XF. Tang CY, et al. Emerg Microbes Infect. 2024 Dec;13(1):2361791. doi: 10.1080/22221751.2024.2361791. Epub 2024 Jun 14. Emerg Microbes Infect. 2024. PMID: 38828796 Free PMC article.

See all "Cited by" articles

References

1. Power D.J., Sharda R., Burstein F. Decision Support Systems. John Wiley & Sons, Ltd.; Hoboken, NJ, USA: 2015.
1. Kourou K., Exarchos T.P., Exarchos K.P., Karamouzis M.V., Fotiadis D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. - DOI - PMC - PubMed
1. Pan L., Liu G., Lin F., Zhong S., Xia H., Sun X., Liang H. Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia. Sci. Rep. 2017;7:7402. doi: 10.1038/s41598-017-07408-0. - DOI - PMC - PubMed
1. Zantalis F., Koulouras G., Karabetsos S., Kandris D. A review of machine learning and IoT in smart transportation. Future Internet. 2019;11:94. doi: 10.3390/fi11040094. - DOI
1. Dixon M.F., Halperin I., Bilokon P. Machine Learning in Finance. Volume 1406 Springer; New York, NY, USA: 2020.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

T2EDK-04207/Operational Program Competitiveness, Entrepreneurship and Innovation

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Power D.J., Sharda R., Burstein F. Decision Support Systems. John Wiley & Sons, Ltd.; Hoboken, NJ, USA: 2015.

[2] Power D.J., Sharda R., Burstein F. Decision Support Systems. John Wiley & Sons, Ltd.; Hoboken, NJ, USA: 2015.

[3] Kourou K., Exarchos T.P., Exarchos K.P., Karamouzis M.V., Fotiadis D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. - DOI - PMC - PubMed

[4] Kourou K., Exarchos T.P., Exarchos K.P., Karamouzis M.V., Fotiadis D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. - DOI - PMC - PubMed

[5] Pan L., Liu G., Lin F., Zhong S., Xia H., Sun X., Liang H. Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia. Sci. Rep. 2017;7:7402. doi: 10.1038/s41598-017-07408-0. - DOI - PMC - PubMed

[6] Pan L., Liu G., Lin F., Zhong S., Xia H., Sun X., Liang H. Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia. Sci. Rep. 2017;7:7402. doi: 10.1038/s41598-017-07408-0. - DOI - PMC - PubMed

[7] Zantalis F., Koulouras G., Karabetsos S., Kandris D. A review of machine learning and IoT in smart transportation. Future Internet. 2019;11:94. doi: 10.3390/fi11040094. - DOI

[8] Zantalis F., Koulouras G., Karabetsos S., Kandris D. A review of machine learning and IoT in smart transportation. Future Internet. 2019;11:94. doi: 10.3390/fi11040094. - DOI

[9] Dixon M.F., Halperin I., Bilokon P. Machine Learning in Finance. Volume 1406 Springer; New York, NY, USA: 2020.

[10] Dixon M.F., Halperin I., Bilokon P. Machine Learning in Finance. Volume 1406 Springer; New York, NY, USA: 2020.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions

Affiliation

A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical