Genome bioinformatic analysis of nonsynonymous SNPs
- PMID: 17708757
- PMCID: PMC1978506
- DOI: 10.1186/1471-2105-8-301
Genome bioinformatic analysis of nonsynonymous SNPs
Abstract
Background: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein.
Results: In the present report we have compared and contrasted structure- and sequence-based methods of prediction to over 5500 genes carrying nearly 24,000 nsSNPs, by employing an automatic comparative modelling procedure to build models for the genes. The nsSNP information came from two sources, the OMIM database which are rare (minor allele frequency, MAF, < 0.01) and are known to cause penetrant, monogenic diseases. Secondly, nsSNP information came from dbSNP125, for which the vast majority of nsSNPs, mostly MAF > 0.05, have no known link to a disease. For over 40% of the nsSNPs, structure-based methods predicted which of these sequence changes are likely to either disrupt the structure of the protein or interfere with the function or interactions of the protein. For the remaining 60%, we generated sequence-based predictions.
Conclusion: We show that, in general, the prediction tools are able distinguish disease causing mutations from those mutations which are thought to have a neutral affect. We give examples of mutations in genes that are predicted to be deleterious and may have a role in disease. Contrary to previous reports, we also show that rare mutations are consistently predicted to be deleterious as often as commonly occurring nsSNPs.
Figures





Similar articles
-
Structure SNP (StSNP): a web server for mapping and modeling nsSNPs on protein structures with linkage to metabolic pathways.Nucleic Acids Res. 2007 Jul;35(Web Server issue):W384-92. doi: 10.1093/nar/gkm232. Epub 2007 May 30. Nucleic Acids Res. 2007. PMID: 17537826 Free PMC article.
-
Accurate prediction of deleterious protein kinase polymorphisms.Bioinformatics. 2007 Nov 1;23(21):2918-25. doi: 10.1093/bioinformatics/btm437. Epub 2007 Sep 12. Bioinformatics. 2007. PMID: 17855419
-
SNPeffect v2.0: a new step in investigating the molecular phenotypic effects of human non-synonymous SNPs.Bioinformatics. 2006 Sep 1;22(17):2183-5. doi: 10.1093/bioinformatics/btl348. Epub 2006 Jun 29. Bioinformatics. 2006. PMID: 16809394
-
Approaches and resources for prediction of the effects of non-synonymous single nucleotide polymorphism on protein function and interactions.Curr Pharm Biotechnol. 2008 Apr;9(2):123-33. doi: 10.2174/138920108783955164. Curr Pharm Biotechnol. 2008. PMID: 18393868 Review.
-
Bioinformatics approaches and resources for single nucleotide polymorphism functional analysis.Brief Bioinform. 2005 Mar;6(1):44-56. doi: 10.1093/bib/6.1.44. Brief Bioinform. 2005. PMID: 15826356 Review.
Cited by
-
Contrasted evolutionary histories of two Toll-like receptors (Tlr4 and Tlr7) in wild rodents (MURINAE).BMC Evol Biol. 2013 Sep 12;13:194. doi: 10.1186/1471-2148-13-194. BMC Evol Biol. 2013. PMID: 24028551 Free PMC article.
-
Deciphering Supramolecular Structures with Protein-Protein Interaction Network Modeling.Sci Rep. 2015 Nov 9;5:16341. doi: 10.1038/srep16341. Sci Rep. 2015. PMID: 26549015 Free PMC article.
-
Disease risk of missense mutations using structural inference from predicted function.Curr Protein Pept Sci. 2010 Nov;11(7):573-88. doi: 10.2174/138920310794109139. Curr Protein Pept Sci. 2010. PMID: 20887259 Free PMC article.
-
SySAP: a system-level predictor of deleterious single amino acid polymorphisms.Protein Cell. 2012 Jan;3(1):38-43. doi: 10.1007/s13238-011-1130-2. Epub 2011 Dec 19. Protein Cell. 2012. PMID: 22183811 Free PMC article.
-
Impact of genetic variation on three dimensional structure and function of proteins.PLoS One. 2017 Mar 15;12(3):e0171355. doi: 10.1371/journal.pone.0171355. eCollection 2017. PLoS One. 2017. PMID: 28296894 Free PMC article.
References
-
- Collins FS, Brooks LD, Chakravarti A. A DNA Polymorphism Discovery Resource for Research on Human Genetic Variation. Genome Research. 1998;8:1229–1231. - PubMed
-
- The Hapmap database http://www.hapmap.org
-
- Clayton DG, Walker NM, Smyth DJ, Pask R, Cooper JD, Maier LM, Smink LJ, Lam AC, Ovington NR, Stevens HE, Nutland S, Howson JM, Faham M, Moorhead M, Jones HB, Falkowski M, Hardenbol P, Willis TD, Todd JA. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet. 2005;37:1243–6. doi: 10.1038/ng1653. - DOI - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources