Identifying novel oncogenes: a machine learning approach
- PMID: 24402816
- DOI: 10.1007/s12539-013-0151-3
Identifying novel oncogenes: a machine learning approach
Abstract
Genome sequencing has overflowed the databases with huge amount of SNP data. Although the amount of detected single nucleotide polymorphisms (SNPs) is rising exponentially every day, we still lag behind in characterization techniques. Implementing computational platforms to determine the pathogenecity associated with the SNPs can provide a probable solution to this problem. To improve the prediction quality for SNP characterization methods, we implemented machine learning support vector classification method. Total 557 non-synonymous amino acid variants were collected from CENP family proteins, excluding CENPE. Multivariate simulation of associated changes in biological phenomena's for each SNPs was computed through available SNP analysis platforms. Support vector model was designed using training dataset and the raw classification data was subjected to the classification hyperplane. We observed multiple evidences of cancer associated genetic mutations in CENPI, CENPJ, CENPK, CENPL and CENPX protein. The former four proteins have showed positive hits in cosmic database for mutations in tumour samples, but CENPX has never been reported before for the cancer associated outcomes. Since CENPX has been recently classified and not much functional and pathological insight has been, the results obtained in this study will serve as a starting point for future investigation on cancer research in association to CENPX protein.
Similar articles
-
Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.Bioinformatics. 2006 Nov 15;22(22):2729-34. doi: 10.1093/bioinformatics/btl423. Epub 2006 Aug 7. Bioinformatics. 2006. PMID: 16895930
-
A new disease-specific machine learning approach for the prediction of cancer-causing missense variants.Genomics. 2011 Oct;98(4):310-7. doi: 10.1016/j.ygeno.2011.06.010. Epub 2011 Jul 7. Genomics. 2011. PMID: 21763417 Free PMC article.
-
Improved feature-based prediction of SNPs in human cytochrome P450 enzymes.Interdiscip Sci. 2015 Mar;7(1):65-77. doi: 10.1007/s12539-014-0257-2. Epub 2015 Mar 21. Interdiscip Sci. 2015. PMID: 25792441
-
Computational intelligence in bioinformatics: SNP/haplotype data in genetic association study for common diseases.IEEE Trans Inf Technol Biomed. 2009 Sep;13(5):841-7. doi: 10.1109/TITB.2009.2024144. Epub 2009 Jun 23. IEEE Trans Inf Technol Biomed. 2009. PMID: 19556205 Review.
-
Identifying driver mutations from sequencing data of heterogeneous tumors in the era of personalized genome sequencing.Brief Bioinform. 2014 Mar;15(2):244-55. doi: 10.1093/bib/bbt042. Epub 2013 Jul 1. Brief Bioinform. 2014. PMID: 23818492 Review.
Cited by
-
Pathogenic nsSNPs that increase the risks of cancers among the Orang Asli and Malays.Sci Rep. 2021 Aug 9;11(1):16158. doi: 10.1038/s41598-021-95618-y. Sci Rep. 2021. PMID: 34373545 Free PMC article.
-
Prediction of a highly deleterious mutation E17K in AKT-1 gene: An in silico approach.Biochem Biophys Rep. 2017 Apr 21;10:260-266. doi: 10.1016/j.bbrep.2017.04.013. eCollection 2017 Jul. Biochem Biophys Rep. 2017. PMID: 29114575 Free PMC article.
-
FAM172A expression in circulating tumor cells for prediction of high-risk subgroups of colorectal cancer.Onco Targets Ther. 2017 Mar 30;10:1933-1939. doi: 10.2147/OTT.S118346. eCollection 2017. Onco Targets Ther. 2017. Retraction in: Onco Targets Ther. 2017 May 03;10:2411. doi: 10.2147/OTT.S140813. PMID: 28408845 Free PMC article. Retracted.
-
Predisposing deleterious variants in the cancer-associated human kinases in the global populations.PLoS One. 2024 Apr 18;19(4):e0298747. doi: 10.1371/journal.pone.0298747. eCollection 2024. PLoS One. 2024. PMID: 38635549 Free PMC article.
-
Pan-Cancer Analysis Reveals CENPI as a Potential Biomarker and Therapeutic Target in Adrenocortical Carcinoma.J Inflamm Res. 2023 Jul 12;16:2907-2928. doi: 10.2147/JIR.S408358. eCollection 2023. J Inflamm Res. 2023. PMID: 37465344 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources