Integrated network analysis and machine learning approach for the identification of key genes of triple-negative breast cancer
- PMID: 30302816
- DOI: 10.1002/jcb.27903
Integrated network analysis and machine learning approach for the identification of key genes of triple-negative breast cancer
Abstract
Triple-negative breast cancer (TNBC) has attracted more attention compared with other breast cancer subtypes due to its aggressive nature, poor prognosis, and chemotherapy remains the mainstay of treatment with no other approved targeted therapy. Therefore, the study aimed to discover more promising therapeutic targets and investigating new insights of biological mechanism of TNBC. Six microarray data sets consisting of 463 non-TNBC and 405 TNBC samples were mined from Gene Expression Omnibus. The data sets were integrated by meta-analysis and identified 1075 differentially expressed genes. Protein-protein interaction network was constructed which consists of 486 nodes and 1932 edges, where 29 hub genes were obtained with high topological measures. Further, 16 features (hub genes), 12 upregulated (AURKB, CCNB2, CDC20, DDX18, EGFR, ENO1, MYC, NUP88, PLK1, PML, POLR2F, and SKP2) and four downregulated ( CCND1, GLI3, SKP1, and TGFB3) were selected through machine learning correlation based feature selection method on training data set. A naïve Bayes based classifier built using the expression profiles of 16 features (hub genes) accurately and reliably classify TNBC from non-TNBC samples in the validation test data set with a receiver operating curve of 0.93 to 0.98. Subsequently, Gene Ontology analysis revealed that the hub genes were enriched in mitotic cell cycle processes and Kyoto Encyclopedia of Genes and Genomes pathway analysis showed that they were enriched in cell cycle pathways. Thus, the identified key hub genes and pathways highlighted in the study would enhance the understanding of molecular mechanism of TNBC which may serve as potential therapeutic target.
Keywords: differentially expressed genes; protein-protein interaction network; receiver operating curve; triple-negative breast cancer.
© 2018 Wiley Periodicals, Inc.
Similar articles
-
Integrated analysis of differentially expressed genes and pathways in triple‑negative breast cancer.Mol Med Rep. 2017 Mar;15(3):1087-1094. doi: 10.3892/mmr.2017.6101. Epub 2017 Jan 4. Mol Med Rep. 2017. PMID: 28075450 Free PMC article.
-
Novel biomarkers identified in triple-negative breast cancer through RNA-sequencing.Clin Chim Acta. 2022 Jun 1;531:302-308. doi: 10.1016/j.cca.2022.04.990. Epub 2022 Apr 30. Clin Chim Acta. 2022. PMID: 35504321
-
Screening and Identification of Key Biomarkers in Inflammatory Breast Cancer Through Integrated Bioinformatic Analyses.Genet Test Mol Biomarkers. 2020 Aug;24(8):484-491. doi: 10.1089/gtmb.2020.0047. Epub 2020 Jun 27. Genet Test Mol Biomarkers. 2020. PMID: 32598242
-
High-throughput «Omics» technologies: New tools for the study of triple-negative breast cancer.Cancer Lett. 2016 Nov 1;382(1):77-85. doi: 10.1016/j.canlet.2016.03.001. Epub 2016 Mar 7. Cancer Lett. 2016. PMID: 26965997 Review.
-
Insights into Molecular Classifications of Triple-Negative Breast Cancer: Improving Patient Selection for Treatment.Cancer Discov. 2019 Feb;9(2):176-198. doi: 10.1158/2159-8290.CD-18-1177. Epub 2019 Jan 24. Cancer Discov. 2019. PMID: 30679171 Free PMC article. Review.
Cited by
-
Sex differences in colonic gene expression and fecal microbiota composition in a mouse model of obesity-associated colorectal cancer.Sci Rep. 2024 Feb 13;14(1):3576. doi: 10.1038/s41598-024-53861-z. Sci Rep. 2024. PMID: 38347027 Free PMC article.
-
Identification of Hub Genes Using Co-Expression Network Analysis in Breast Cancer as a Tool to Predict Different Stages.Med Sci Monit. 2019 Nov 23;25:8873-8890. doi: 10.12659/MSM.919046. Med Sci Monit. 2019. PMID: 31758680 Free PMC article.
-
Analysis of differentially expressed mRNAs and the prognosis of cholangiocarcinoma based on TCGA database.Transl Cancer Res. 2020 Aug;9(8):4739-4749. doi: 10.21037/tcr-20-812. Transl Cancer Res. 2020. PMID: 35117837 Free PMC article.
-
A genome-wide screen for human salicylic acid (SA)-binding proteins reveals targets through which SA may influence development of various diseases.Sci Rep. 2019 Sep 11;9(1):13084. doi: 10.1038/s41598-019-49234-6. Sci Rep. 2019. PMID: 31511554 Free PMC article.
-
Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning.Biochem Genet. 2024 Dec;62(6):5022-5050. doi: 10.1007/s10528-024-10712-w. Epub 2024 Feb 21. Biochem Genet. 2024. PMID: 38383836
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous