Theory and Practice of Integrating Machine Learning and Conventional Statistics in Medical Data Analysis
- PMID: 36292218
- PMCID: PMC9601117
- DOI: 10.3390/diagnostics12102526
Theory and Practice of Integrating Machine Learning and Conventional Statistics in Medical Data Analysis
Abstract
The practice of medical decision making is changing rapidly with the development of innovative computing technologies. The growing interest of data analysis with improvements in big data computer processing methods raises the question of whether machine learning can be integrated with conventional statistics in health research. To help address this knowledge gap, this paper presents a review on the conceptual integration between conventional statistics and machine learning, focusing on the health research. The similarities and differences between the two are compared using mathematical concepts and algorithms. The comparison between conventional statistics and machine learning methods indicates that conventional statistics are the fundamental basis of machine learning, where the black box algorithms are derived from basic mathematics, but are advanced in terms of automated analysis, handling big data and providing interactive visualizations. While the nature of both these methods are different, they are conceptually similar. Based on our review, we conclude that conventional statistics and machine learning are best to be integrated to develop automated data analysis tools. We also strongly believe that machine learning could be explored by health researchers to enhance conventional statistics in decision making for added reliable validation measures.
Keywords: comparison; conventional statistics; data analytics; health research; integration; machine learning.
Conflict of interest statement
The authors declare no conflict of interest.
Figures







Similar articles
-
The future of Cochrane Neonatal.Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834
-
Deep generative learning for automated EHR diagnosis of traditional Chinese medicine.Comput Methods Programs Biomed. 2019 Jun;174:17-23. doi: 10.1016/j.cmpb.2018.05.008. Epub 2018 May 4. Comput Methods Programs Biomed. 2019. PMID: 29801696
-
Peering Into the Black Box of Artificial Intelligence: Evaluation Metrics of Machine Learning Methods.AJR Am J Roentgenol. 2019 Jan;212(1):38-43. doi: 10.2214/AJR.18.20224. Epub 2018 Oct 17. AJR Am J Roentgenol. 2019. PMID: 30332290 Review.
-
Design and Implementation of Advanced Machine Learning Management and Its Impact on Better Healthcare Services: A Multiple Regression Analysis Approach (MRAA).Comput Math Methods Med. 2022 Apr 4;2022:2489116. doi: 10.1155/2022/2489116. eCollection 2022. Comput Math Methods Med. 2022. Retraction in: Comput Math Methods Med. 2023 Dec 13;2023:9780494. doi: 10.1155/2023/9780494. PMID: 35419074 Free PMC article. Retracted.
-
Clinical Text Data in Machine Learning: Systematic Review.JMIR Med Inform. 2020 Mar 31;8(3):e17984. doi: 10.2196/17984. JMIR Med Inform. 2020. PMID: 32229465 Free PMC article. Review.
Cited by
-
An analysis of factors influencing dropout in methadone maintenance treatment program in Dehong Prefecture of China based on Cox regression and decision tree modelling.BMC Health Serv Res. 2025 Mar 26;25(1):439. doi: 10.1186/s12913-025-12538-7. BMC Health Serv Res. 2025. PMID: 40141001 Free PMC article.
-
Predictive model of ibuprofen treatment failure in very preterm infants with patent ductus arteriosus using machine learning techniques.J Perinatol. 2025 Jul;45(7):944-950. doi: 10.1038/s41372-025-02346-6. Epub 2025 Jul 8. J Perinatol. 2025. PMID: 40629049
-
Dental age estimation by comparing Demirjian's method and machine learning in Southeast Brazilian youth.Forensic Sci Med Pathol. 2025 Jul 11. doi: 10.1007/s12024-025-01042-3. Online ahead of print. Forensic Sci Med Pathol. 2025. PMID: 40643883
-
Development and validation of an explainable machine learning prediction model of hemorrhagic transformation after intravenous thrombolysis in stroke.Front Neurol. 2025 Jan 15;15:1446250. doi: 10.3389/fneur.2024.1446250. eCollection 2024. Front Neurol. 2025. PMID: 39882362 Free PMC article.
-
Comparison of spatial prediction models from Machine Learning of cholangiocarcinoma incidence in Thailand.BMC Public Health. 2025 Jun 7;25(1):2137. doi: 10.1186/s12889-025-23119-y. BMC Public Health. 2025. PMID: 40483400 Free PMC article.
References
-
- Tonekaboni S., Joshi S., McCradden M.D., Goldenberg A. What clinicians want: Contextualizing explainable machine learning for clinical end use. arXiv. 2019:arXiv:1905.05134.
Publication types
LinkOut - more resources
Full Text Sources
Miscellaneous