The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
- PMID: 26801878
- DOI: 10.1016/j.bbagen.2016.01.012
The use of Gene Ontology terms and KEGG pathways for analysis and prediction of oncogenes
Abstract
Background: Oncogenes are a type of genes that have the potential to cause cancer. Most normal cells undergo programmed cell death, namely apoptosis, but activated oncogenes can help cells avoid apoptosis and survive. Thus, studying oncogenes is helpful for obtaining a good understanding of the formation and development of various types of cancers.
Methods: In this study, we proposed a computational method, called OPM, for investigating oncogenes from the view of Gene Ontology (GO) and biological pathways. All investigated genes, including validated oncogenes retrieved from some public databases and other genes that have not been reported to be oncogenes thus far, were encoded into numeric vectors according to the enrichment theory of GO terms and KEGG pathways. Some popular feature selection methods, minimum redundancy maximum relevance and incremental feature selection, and an advanced machine learning algorithm, random forest, were adopted to analyze the numeric vectors to extract key GO terms and KEGG pathways.
Results: Along with the oncogenes, GO terms and KEGG pathways were discussed in terms of their relevance in this study. Some important GO terms and KEGG pathways were extracted using feature selection methods and were confirmed to be highly related to oncogenes. Additionally, the importance of these terms and pathways in predicting oncogenes was further demonstrated by finding new putative oncogenes based on them.
Conclusions: This study investigated oncogenes based on GO terms and KEGG pathways. Some important GO terms and KEGG pathways were confirmed to be highly related to oncogenes. We hope that these GO terms and KEGG pathways can provide new insight for the study of oncogenes, particularly for building more effective prediction models to identify novel oncogenes. The program is available upon request.
General significance: We hope that the new findings listed in this study may provide a new insight for the investigation of oncogenes. This article is part of a Special Issue entitled "System Genetics" Guest Editor: Dr. Yudong Cai and Dr. Tao Huang.
Keywords: Gene Ontology; Incremental feature selection; KEGG pathway; Minimum redundancy maximum relevance; Oncogenes; Random forest.
Copyright © 2016. Published by Elsevier B.V.
Similar articles
-
Analysis of the chemical toxicity effects using the enrichment of Gene Ontology terms and KEGG pathways.Biochim Biophys Acta. 2016 Nov;1860(11 Pt B):2619-26. doi: 10.1016/j.bbagen.2016.05.015. Epub 2016 May 18. Biochim Biophys Acta. 2016. PMID: 27208425
-
Analysis of cancer-related lncRNAs using gene ontology and KEGG pathways.Artif Intell Med. 2017 Feb;76:27-36. doi: 10.1016/j.artmed.2017.02.001. Epub 2017 Feb 13. Artif Intell Med. 2017. PMID: 28363286
-
Analysis of Important Gene Ontology Terms and Biological Pathways Related to Pancreatic Cancer.Biomed Res Int. 2016;2016:7861274. doi: 10.1155/2016/7861274. Epub 2016 Nov 9. Biomed Res Int. 2016. PMID: 27957501 Free PMC article.
-
Identification of gene ontology and pathways implicated in suicide behavior: Systematic review and enrichment analysis of GWAS studies.Am J Med Genet B Neuropsychiatr Genet. 2019 Jul;180(5):320-329. doi: 10.1002/ajmg.b.32731. Epub 2019 May 2. Am J Med Genet B Neuropsychiatr Genet. 2019. PMID: 31045331
-
Deciphering hallmark processes of aging from interaction networks.Biochim Biophys Acta. 2016 Nov;1860(11 Pt B):2706-15. doi: 10.1016/j.bbagen.2016.07.017. Epub 2016 Jul 25. Biochim Biophys Acta. 2016. PMID: 27456767 Review.
Cited by
-
High throughput deep degradome sequencing reveals microRNAs and their targets in response to drought stress in mulberry (Morus alba).PLoS One. 2017 Feb 24;12(2):e0172883. doi: 10.1371/journal.pone.0172883. eCollection 2017. PLoS One. 2017. PMID: 28235056 Free PMC article.
-
Identification of differentially expressed protein-coding genes in lung adenocarcinomas.Exp Ther Med. 2020 Feb;19(2):1103-1111. doi: 10.3892/etm.2019.8300. Epub 2019 Dec 6. Exp Ther Med. 2020. PMID: 32010276 Free PMC article.
-
Integrative systems biology and in-vitro analysis of cryptolepine's therapeutic role in breast cancer.Discov Oncol. 2025 Aug 11;16(1):1520. doi: 10.1007/s12672-025-03158-y. Discov Oncol. 2025. PMID: 40784974 Free PMC article.
-
Transcriptome Analysis Suggests the Roles of Long Intergenic Non-coding RNAs in the Growth Performance of Weaned Piglets.Front Genet. 2019 Mar 18;10:196. doi: 10.3389/fgene.2019.00196. eCollection 2019. Front Genet. 2019. PMID: 30936891 Free PMC article.
-
Identification of hub genes, key miRNAs and potential molecular mechanisms of colorectal cancer.Oncol Rep. 2017 Oct;38(4):2043-2050. doi: 10.3892/or.2017.5930. Epub 2017 Aug 29. Oncol Rep. 2017. PMID: 28902367 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources