Hub gene identification and molecular subtype construction for Helicobacter pylori in gastric cancer via machine learning methods and NMF algorithm
- PMID: 37768204
- PMCID: PMC10683617
- DOI: 10.18632/aging.205053
Hub gene identification and molecular subtype construction for Helicobacter pylori in gastric cancer via machine learning methods and NMF algorithm
Abstract
Helicobacter pylori (HP) is a gram-negative and spiral-shaped bacterium colonizing the human stomach and has been recognized as the risk factor of gastritis, peptic ulcer disease, and gastric cancer (GC). Moreover, it was recently identified as a class I carcinogen, which affects the occurrence and progression of GC via inducing various oncogenic pathways. Therefore, identifying the HP-related key genes is crucial for understanding the oncogenic mechanisms and improving the outcomes of GC patients. We retrieved the list of HP-related gene sets from the Molecular Signatures Database. Based on the HP-related genes, unsupervised non-negative matrix factorization (NMF) clustering method was conducted to stratify TCGA-STAD, GSE15459, GSE84433 samples into two clusters with distinct clinical outcomes and immune infiltration characterization. Subsequently, two machine learning (ML) strategies, including support vector machine-recursive feature elimination (SVM-RFE) and random forest (RF), were employed to determine twelve hub HP-related genes. Beyond that, receiver operating characteristic and Kaplan-Meier curves further confirmed the diagnostic value and prognostic significance of hub genes. Finally, expression of HP-related hub genes was tested by qRT-PCR array and immunohistochemical images. Additionally, functional pathway enrichment analysis indicated that these hub genes were implicated in the genesis and progression of GC by activating or inhibiting the classical cancer-associated pathways, such as epithelial-mesenchymal transition, cell cycle, apoptosis, RAS/MAPK, etc. In the present study, we constructed a novel HP-related tumor classification in different datasets, and screened out twelve hub genes via performing the ML algorithms, which may contribute to the molecular diagnosis and personalized therapy of GC.
Keywords: Helicobacter pylori; cluster; gastric cancer; hub genes; therapy.
Conflict of interest statement
Figures










Similar articles
-
A Multi-Omics Study on the Effect of Helicobacter Pylori-Related Genes in the Tumor Immunity on Stomach Adenocarcinoma.Front Cell Infect Microbiol. 2022 May 10;12:880636. doi: 10.3389/fcimb.2022.880636. eCollection 2022. Front Cell Infect Microbiol. 2022. PMID: 35619651 Free PMC article.
-
Identification of PTPN20 as an innate immunity-related gene in gastric cancer with Helicobacter pylori infection.Front Immunol. 2023 Jun 9;14:1212692. doi: 10.3389/fimmu.2023.1212692. eCollection 2023. Front Immunol. 2023. PMID: 37359510 Free PMC article.
-
Discovery and validation of methylated-differentially expressed genes in Helicobacter pylori-induced gastric cancer.Cancer Gene Ther. 2020 Jun;27(6):473-485. doi: 10.1038/s41417-019-0125-7. Epub 2019 Jul 16. Cancer Gene Ther. 2020. PMID: 31308482
-
Chemoprevention of gastric cancer by Helicobacter pylori eradication and its underlying mechanism.J Gastroenterol Hepatol. 2019 Aug;34(8):1287-1295. doi: 10.1111/jgh.14646. Epub 2019 Mar 27. J Gastroenterol Hepatol. 2019. PMID: 30828872 Review.
-
Gastric cancer and Helicobacter pylori infection.J Physiol Pharmacol. 2006 Sep;57 Suppl 3:51-65. J Physiol Pharmacol. 2006. PMID: 17033105 Review.
Cited by
-
Metagenomics reveals unique gut mycobiome biomarkers in major depressive disorder - a non-invasive method.Front Cell Infect Microbiol. 2025 Jun 4;15:1582522. doi: 10.3389/fcimb.2025.1582522. eCollection 2025. Front Cell Infect Microbiol. 2025. PMID: 40535544 Free PMC article.
References
-
- Allemani C, Weir HK, Carreira H, Harewood R, Spika D, Wang XS, Bannon F, Ahn JV, Johnson CJ, Bonaventure A, Marcos-Gragera R, Stiller C, Azevedo e Silva G, et al., and CONCORD Working Group. Global surveillance of cancer survival 1995-2009: analysis of individual data for 25,676,887 patients from 279 population-based registries in 67 countries (CONCORD-2). Lancet. 2015; 385:977–1010. 10.1016/S0140-6736(14)62038-9 - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous