PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations
- PMID: 24453961
- PMCID: PMC3894168
- DOI: 10.1371/journal.pcbi.1003440
PredictSNP: robust and accurate consensus classifier for prediction of disease-related mutations
Abstract
Single nucleotide variants represent a prevalent form of genetic variation. Mutations in the coding regions are frequently associated with the development of various genetic diseases. Computational tools for the prediction of the effects of mutations on protein function are very important for analysis of single nucleotide variants and their prioritization for experimental characterization. Many computational tools are already widely employed for this purpose. Unfortunately, their comparison and further improvement is hindered by large overlaps between the training datasets and benchmark datasets, which lead to biased and overly optimistic reported performances. In this study, we have constructed three independent datasets by removing all duplicities, inconsistencies and mutations previously used in the training of evaluated tools. The benchmark dataset containing over 43,000 mutations was employed for the unbiased evaluation of eight established prediction tools: MAPP, nsSNPAnalyzer, PANTHER, PhD-SNP, PolyPhen-1, PolyPhen-2, SIFT and SNAP. The six best performing tools were combined into a consensus classifier PredictSNP, resulting into significantly improved prediction performance, and at the same time returned results for all mutations, confirming that consensus prediction represents an accurate and robust alternative to the predictions delivered by individual tools. A user-friendly web interface enables easy access to all eight prediction tools, the consensus classifier PredictSNP and annotations from the Protein Mutant Database and the UniProt database. The web server and the datasets are freely available to the academic community at http://loschmidt.chemi.muni.cz/predictsnp.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures





Similar articles
-
PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions.PLoS Comput Biol. 2016 May 25;12(5):e1004962. doi: 10.1371/journal.pcbi.1004962. eCollection 2016 May. PLoS Comput Biol. 2016. PMID: 27224906 Free PMC article.
-
Performance of mutation pathogenicity prediction methods on missense variants.Hum Mutat. 2011 Apr;32(4):358-68. doi: 10.1002/humu.21445. Epub 2011 Feb 22. Hum Mutat. 2011. PMID: 21412949
-
Assessment of the predictive accuracy of five in silico prediction tools, alone or in combination, and two metaservers to classify long QT syndrome gene mutations.BMC Med Genet. 2015 May 13;16:34. doi: 10.1186/s12881-015-0176-z. BMC Med Genet. 2015. PMID: 25967940 Free PMC article.
-
Comparison and integration of computational methods for deleterious synonymous mutation prediction.Brief Bioinform. 2020 May 21;21(3):970-981. doi: 10.1093/bib/bbz047. Brief Bioinform. 2020. PMID: 31157880 Review.
-
DECIPHER: web-based, community resource for clinical interpretation of rare variants in developmental disorders.Hum Mol Genet. 2012 Oct 15;21(R1):R37-44. doi: 10.1093/hmg/dds362. Epub 2012 Sep 8. Hum Mol Genet. 2012. PMID: 22962312 Free PMC article. Review.
Cited by
-
Unraveling the potential effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on the Protein structure and function of the human SLC30A8 gene on type 2 diabetes and colorectal cancer: An In silico approach.Heliyon. 2024 Aug 31;10(17):e37280. doi: 10.1016/j.heliyon.2024.e37280. eCollection 2024 Sep 15. Heliyon. 2024. PMID: 39296124 Free PMC article.
-
Molecular dynamics study of tropical calcific pancreatitis (TCP) associated calcium-sensing receptor single nucleotide variation.Front Mol Biosci. 2022 Oct 4;9:982831. doi: 10.3389/fmolb.2022.982831. eCollection 2022. Front Mol Biosci. 2022. PMID: 36275616 Free PMC article.
-
Antimicrobial Susceptibility Pattern of Helicobacter heilmannii and Helicobacter ailurogastricus Isolates.Microorganisms. 2020 Jun 25;8(6):957. doi: 10.3390/microorganisms8060957. Microorganisms. 2020. PMID: 32630563 Free PMC article.
-
Clinical relevance of short-chain acyl-CoA dehydrogenase (SCAD) deficiency: Exploring the role of new variants including the first SCAD-disease-causing allele carrying a synonymous mutation.BBA Clin. 2016 Mar 10;5:114-9. doi: 10.1016/j.bbacli.2016.03.004. eCollection 2016 Jun. BBA Clin. 2016. PMID: 27051597 Free PMC article.
-
EDA Missense Variant in a Cat with X-Linked Hypohidrotic Ectodermal Dysplasia.Genes (Basel). 2024 Jun 28;15(7):854. doi: 10.3390/genes15070854. Genes (Basel). 2024. PMID: 39062633 Free PMC article.
References
-
- Collins FS, Brooks LD, Chakravarti A (1998) A DNA polymorphism discovery resource for research on human genetic variation. Genome Res 8: 1229–1231 - PubMed
-
- Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, et al. (2010) A map of human genome variation from population-scale sequencing. Nature 467: 1061–1073doi:10.1038/nature09534 - DOI - PMC - PubMed
-
- Collins FS, Guyer MS, Charkravarti A (1997) Variations on a theme: cataloging human DNA sequence variation. Science 278: 1580–1581 - PubMed
-
- Risch N, Merikangas K (1996) The future of genetic studies of complex human diseases. Science 273: 1516–1517 - PubMed
-
- Studer RA, Dessailly BH, Orengo CA (2013) Residue mutations and their impact on protein structure and function: detecting beneficial and pathogenic changes. Biochem J 449: 581–594doi:10.1042/BJ20121221 - DOI - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases