A machine-learning approach for nonalcoholic steatohepatitis susceptibility estimation
- PMID: 36367682
- DOI: 10.1007/s12664-022-01263-2
A machine-learning approach for nonalcoholic steatohepatitis susceptibility estimation
Abstract
Background: Nonalcoholic steatohepatitis (NASH), a severe form of nonalcoholic fatty liver disease, can lead to advanced liver damage and has become an increasingly prominent health problem worldwide. Predictive models for early identification of high-risk individuals could help identify preventive and interventional measures. Traditional epidemiological models with limited predictive power are based on statistical analysis. In the current study, a novel machine-learning approach was developed for individual NASH susceptibility prediction using candidate single nucleotide polymorphisms (SNPs).
Methods: A total of 245 NASH patients and 120 healthy individuals were included in the study. Single nucleotide polymorphism genotypes of candidate genes including two SNPs in the cytochrome P450 family 2 subfamily E member 1 (CYP2E1) gene (rs6413432, rs3813867), two SNPs in the glucokinase regulator (GCKR) gene (rs780094, rs1260326), rs738409 SNP in patatin-like phospholipase domain-containing 3 (PNPLA3), and gender parameters were used to develop models for identifying at-risk individuals. To predict the individual's susceptibility to NASH, nine different machine-learning models were constructed. These models involved two different feature selections including Chi-square, and support vector machine recursive feature elimination (SVM-RFE) and three classification algorithms including k-nearest neighbor (KNN), multi-layer perceptron (MLP), and random forest (RF). All nine machine-learning models were trained using 80% of both the NASH patients and the healthy controls data. The nine machine-learning models were then tested on 20% of both groups. The model's performance was compared for model accuracy, precision, sensitivity, and F measure.
Results: Among all nine machine-learning models, the KNN classifier with all features as input showed the highest performance with 86% F measure and 79% accuracy.
Conclusions: Machine learning based on genomic variety may be applicable for estimating an individual's susceptibility for developing NASH among high-risk groups with a high degree of accuracy, precision, and sensitivity.
Keywords: Algorithm; Artificial intelligence; Disease susceptibility; Fatty liver; Gene; Machine learning; Neural network model; Nonalcoholic fatty liver disease; Nonalcoholic steatohepatitis; Single nucleotide polymorphism; Support vector machine.
© 2022. Indian Society of Gastroenterology.
Similar articles
-
Genotypic variation in CYP2E1, GCKR, and PNPLA3 among nonalcoholic steatohepatitis patients of Turkish origin.Mol Biol Rep. 2024 Jul 23;51(1):845. doi: 10.1007/s11033-024-09787-w. Mol Biol Rep. 2024. PMID: 39042259
-
Interactions of a PPARGC1A Variant and a PNPLA3 Variant Affect Nonalcoholic Steatohepatitis in Severely Obese Taiwanese Patients.Medicine (Baltimore). 2016 Mar;95(12):e3120. doi: 10.1097/MD.0000000000003120. Medicine (Baltimore). 2016. PMID: 27015186 Free PMC article.
-
Association of PNPLA3 rs738409 G/C gene polymorphism with nonalcoholic fatty liver disease in children: a meta-analysis.BMC Med Genet. 2020 Aug 18;21(1):163. doi: 10.1186/s12881-020-01098-8. BMC Med Genet. 2020. PMID: 32811452 Free PMC article.
-
Association between patatin-like phospholipase domain containing 3 gene (PNPLA3) polymorphisms and nonalcoholic fatty liver disease: a HuGE review and meta-analysis.Sci Rep. 2015 Mar 20;5:9284. doi: 10.1038/srep09284. Sci Rep. 2015. PMID: 25791171 Free PMC article. Review.
-
The genetic backgrounds in nonalcoholic fatty liver disease.Clin J Gastroenterol. 2018 Apr;11(2):97-102. doi: 10.1007/s12328-018-0841-9. Epub 2018 Feb 28. Clin J Gastroenterol. 2018. PMID: 29492830 Review.
Cited by
-
Innovative approaches to metabolic dysfunction-associated steatohepatitis diagnosis and stratification.Noncoding RNA Res. 2024 Oct 11;10:206-222. doi: 10.1016/j.ncrna.2024.10.002. eCollection 2025 Feb. Noncoding RNA Res. 2024. PMID: 40248839 Free PMC article.
-
Genotypic variation in CYP2E1, GCKR, and PNPLA3 among nonalcoholic steatohepatitis patients of Turkish origin.Mol Biol Rep. 2024 Jul 23;51(1):845. doi: 10.1007/s11033-024-09787-w. Mol Biol Rep. 2024. PMID: 39042259
References
-
- Vespasiani-Gentilucci U, Gallo P, Dell'Unto C, Volpentesta M, Antonelli-Incalzi R, Picardi A. Promoting genetics in non-alcoholic fatty liver disease: combined risk score through polymorphisms and clinical variables. World J Gastroenterol. 2018;24:4835–45.
-
- Vilar-Gomez E, Chalasani N. Non-invasive assessment of non-alcoholic fatty liver disease: clinical prediction rules and blood-based biomarkers. J Hepatol. 2018;68:305–15.
-
- Anstee QM, Seth D, Day CP. Genetic factors that affect risk of alcoholic and nonalcoholic fatty liver disease. Gastroenterology. 2016;150:1728–44.e7.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical