Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information
- PMID: 16895930
- DOI: 10.1093/bioinformatics/btl423
Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information
Abstract
Motivation: Human single nucleotide polymorphisms (SNPs) are the most frequent type of genetic variation in human population. One of the most important goals of SNP projects is to understand which human genotype variations are related to Mendelian and complex diseases. Great interest is focused on non-synonymous coding SNPs (nsSNPs) that are responsible of protein single point mutation. nsSNPs can be neutral or disease associated. It is known that the mutation of only one residue in a protein sequence can be related to a number of pathological conditions of dramatic social impact such as Alzheimer's, Parkinson's and Creutzfeldt-Jakob's diseases. The quality and completeness of presently available SNPs databases allows the application of machine learning techniques to predict the insurgence of human diseases due to single point protein mutation starting from the protein sequence.
Results: In this paper, we develop a method based on support vector machines (SVMs) that starting from the protein sequence information can predict whether a new phenotype derived from a nsSNP can be related to a genetic disease in humans. Using a dataset of 21 185 single point mutations, 61% of which are disease-related, out of 3587 proteins, we show that our predictor can reach more than 74% accuracy in the specific task of predicting whether a single point mutation can be disease related or not. Our method, although based on less information, outperforms other web-available predictors implementing different approaches.
Availability: A beta version of the web tool is available at http://gpcr.biocomp.unibo.it/cgi/predictors/PhD-SNP/PhD-SNP.cgi
Similar articles
-
Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans.Hum Mutat. 2008 Jan;29(1):198-204. doi: 10.1002/humu.20628. Hum Mutat. 2008. PMID: 17935148
-
Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information.Bioinformatics. 2005 May 15;21(10):2185-90. doi: 10.1093/bioinformatics/bti365. Epub 2005 Mar 3. Bioinformatics. 2005. PMID: 15746281
-
Predicting protein stability changes from sequences using support vector machines.Bioinformatics. 2005 Sep 1;21 Suppl 2:ii54-8. doi: 10.1093/bioinformatics/bti1109. Bioinformatics. 2005. PMID: 16204125
-
Computational prediction of the effects of non-synonymous single nucleotide polymorphisms in human DNA repair genes.Neuroscience. 2007 Apr 14;145(4):1273-9. doi: 10.1016/j.neuroscience.2006.09.004. Epub 2006 Oct 19. Neuroscience. 2007. PMID: 17055652 Review.
-
Bioinformatics tools for single nucleotide polymorphism discovery and analysis.Ann N Y Acad Sci. 2004 May;1020:101-9. doi: 10.1196/annals.1310.011. Ann N Y Acad Sci. 2004. PMID: 15208187 Review.
Cited by
-
Computational and Structural Analysis to Assess the Pathogenicity of Bardet-Biedl Syndrome Related Missense Variants Identified in Bardet-Biedl Syndrome 10 Gene (BBS10).ACS Omega. 2022 Oct 12;7(42):37654-37662. doi: 10.1021/acsomega.2c04522. eCollection 2022 Oct 25. ACS Omega. 2022. PMID: 36312387 Free PMC article.
-
Predicted Molecular Effects of Sequence Variants Link to System Level of Disease.PLoS Comput Biol. 2016 Aug 18;12(8):e1005047. doi: 10.1371/journal.pcbi.1005047. eCollection 2016 Aug. PLoS Comput Biol. 2016. PMID: 27536940 Free PMC article.
-
Prediction of Functional Consequences of Missense Mutations in ANO4 Gene.Int J Mol Sci. 2021 Mar 8;22(5):2732. doi: 10.3390/ijms22052732. Int J Mol Sci. 2021. PMID: 33800471 Free PMC article.
-
Unraveling Extremely Damaging IRAK4 Variants and Their Potential Implications for IRAK4 Inhibitor Efficacy.J Pers Med. 2023 Nov 26;13(12):1648. doi: 10.3390/jpm13121648. J Pers Med. 2023. PMID: 38138875 Free PMC article.
-
In silico screening and molecular dynamics simulation of disease-associated nsSNP in TYRP1 gene and its structural consequences in OCA3.Biomed Res Int. 2013;2013:697051. doi: 10.1155/2013/697051. Epub 2013 Jun 19. Biomed Res Int. 2013. PMID: 23862152 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases