Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Jun 9;11(1):1-21.
doi: 10.1002/cdt3.137. eCollection 2025 Mar.

Pathways to chronic disease detection and prediction: Mapping the potential of machine learning to the pathophysiological processes while navigating ethical challenges

Affiliations
Review

Pathways to chronic disease detection and prediction: Mapping the potential of machine learning to the pathophysiological processes while navigating ethical challenges

Ebenezer Afrifa-Yamoah et al. Chronic Dis Transl Med. .

Abstract

Chronic diseases such as heart disease, cancer, and diabetes are leading drivers of mortality worldwide, underscoring the need for improved efforts around early detection and prediction. The pathophysiology and management of chronic diseases have benefitted from emerging fields in molecular biology like genomics, transcriptomics, proteomics, glycomics, and lipidomics. The complex biomarker and mechanistic data from these "omics" studies present analytical and interpretive challenges, especially for traditional statistical methods. Machine learning (ML) techniques offer considerable promise in unlocking new pathways for data-driven chronic disease risk assessment and prognosis. This review provides a comprehensive overview of state-of-the-art applications of ML algorithms for chronic disease detection and prediction across datasets, including medical imaging, genomics, wearables, and electronic health records. Specifically, we review and synthesize key studies leveraging major ML approaches ranging from traditional techniques such as logistic regression and random forests to modern deep learning neural network architectures. We consolidate existing literature to date around ML for chronic disease prediction to synthesize major trends and trajectories that may inform both future research and clinical translation efforts in this growing field. While highlighting the critical innovations and successes emerging in this space, we identify the key challenges and limitations that remain to be addressed. Finally, we discuss pathways forward toward scalable, equitable, and clinically implementable ML solutions for transforming chronic disease screening and prevention.

Keywords: OMICs data; big data; chronic diseases; disease prediction; machine learning algorithms.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Laboratory methods and application of machine learning for disease prediction and diagnosis. Diagnosis of diseases begins with screening a population, sample collection, processing, and quantitation with molecular/analytical methods (e.g., polymerase chain reaction [PCR], liquid chromatography mass spectrophotometry [LC‐MS], high‐performance liquid chromatography [HPLC], sodium dodecyl sulfate polyacrylamide gel electrophoresis [SDS‐PAGE]), and immunofluorescent macroscopy among others. Machine learning algorithms can discover patterns in the data generated from analytical methods and make predictions.
Figure 2
Figure 2
Pathophysiology of Type II diabetes. Alpha and beta cells of the pancreas secret insulin and glucagon, respectively. These two hormones interact with the liver to regulate blood sugar levels. While glucagon stimulates glycogenolysis and gluconeogenesis, leading to a rise in sugar levels in the blood, insulin promotes glycogenesis and glucose uptake in skeletal muscles and other tissues. In diabetes, there is either impaired insulin secretion or insulin resistance, resulting in elevated plasma concentration of sugar.
Figure 3
Figure 3
Pathophysiology of chronic kidney disease. Kidney injury results in loss of nephrons and increased levels of angiotensin II. Angiotensin II is a vasoconstrictor that triggers increased blood pressure. Alongside hypertension, there is an increase in glomerular permeability, tubular protein reabsorption, tubular/interstitial inflammation, and eventually renal scarring.
Figure 4
Figure 4
Different types of cardiovascular diseases.
Figure 5
Figure 5
Principal component analysis (PCA) (with 95% confidence ellipses) showcases the clustering patterns and reasonable discriminatory power for samples with congenital heart defects (case) and control using the top 20 expressive genes identified via RF based on orthogonal linear combinations of the features.
Figure 6
Figure 6
The metastatic cascade. The spread of a tumor is characterized by a sequence of events. These events are local invasion, intravasation, circulation through the vasculature, extravasation, formation of micrometastasis, and colonization.
Figure 7
Figure 7
Response to inflammation. The components of inflammation include vascular changes―vasodilation that results in increased blood flow and increased vascular permeability that allows fluids to reach the infected site. Immune cells, including macrophages and leukocytes, are also activated to phagocytose infectious agents. Monocytes are also recruited, which become macrophages. These macrophages can differentiate into M1 and M2 when stimulated by cytokines. M1 releases the anti‐bactericidal compound NO, whereas M2 macrophages promote wound healing and tissue repair. Protein systems, including the complement, coagulation, and kinin, are all activated inflammatory responses. Depending on the stimulus, T‐cells may be differentiated into TH1, TH2, and TH17, and all these are involved in various stages of inflammation.

References

    1. World Health Organisation (WHO) . Non‐Communicable Diseases , WHO; 2021. Accessed January 2, 2024. https://www.who.int/news-room/fact-sheets/detail/noncommunicable-diseases
    1. Adua E, Roberts P, Sakyi SA, et al. Profiling of cardio‐metabolic risk factors and medication utilisation among type II diabetes patients in Ghana: a prospective cohort study. Clin Transl Med. 2017;6:e32. 10.1186/s40169-017-0162-5 - DOI - PMC - PubMed
    1. Adua E, Kolog EA, Afrifa‐Yamoah E, et al. Predictive model and feature importance for early detection of type II diabetes mellitus. Transl Med Commun. 2021;6:17. 10.1186/s41231-021-00096-z - DOI
    1. Adua E, Afrifa‐Yamoah E, Kolog EA. Leveraging supervised machine learning for determining the link between suboptimal health status and the prognosis of chronic diseases. In: Wang W, ed., All Around Suboptimal Health. Advances in Predictive, Preventive and Personalised Medicine. Springer; 2024:18. 10.1007/978-3-031-46891-9_9 - DOI
    1. Golubnitschaja O, Baban B, Boniolo G, et al. Medicine in the early twenty‐first century: paradigm and anticipation‐EPMA position paper 2016. EPMA J. 2016;7:23. 10.1186/s13167-016-0072-4 - DOI - PMC - PubMed

LinkOut - more resources