Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Aug 20:8:301.
doi: 10.1186/1471-2105-8-301.

Genome bioinformatic analysis of nonsynonymous SNPs

Affiliations

Genome bioinformatic analysis of nonsynonymous SNPs

David F Burke et al. BMC Bioinformatics. .

Abstract

Background: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein.

Results: In the present report we have compared and contrasted structure- and sequence-based methods of prediction to over 5500 genes carrying nearly 24,000 nsSNPs, by employing an automatic comparative modelling procedure to build models for the genes. The nsSNP information came from two sources, the OMIM database which are rare (minor allele frequency, MAF, < 0.01) and are known to cause penetrant, monogenic diseases. Secondly, nsSNP information came from dbSNP125, for which the vast majority of nsSNPs, mostly MAF > 0.05, have no known link to a disease. For over 40% of the nsSNPs, structure-based methods predicted which of these sequence changes are likely to either disrupt the structure of the protein or interfere with the function or interactions of the protein. For the remaining 60%, we generated sequence-based predictions.

Conclusion: We show that, in general, the prediction tools are able distinguish disease causing mutations from those mutations which are thought to have a neutral affect. We give examples of mutations in genes that are predicted to be deleterious and may have a role in disease. Contrary to previous reports, we also show that rare mutations are consistently predicted to be deleterious as often as commonly occurring nsSNPs.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of fold recognition results. Distribution of the number of HOMSTRAD families predicted by FUGUE for a gene.
Figure 2
Figure 2
Model of IL21R/IL21 complex. Ribbon representation of the model of IL21R in complex with its ligand, IL21. The residue which is mutated, Arg191, is show is ball-and-stick representation.
Figure 3
Figure 3
Model of TCF7 in complex with DNA. Ribbon representation of the model of TCF7 in complex with DNA. The residue which is mutated, Trp336, is show is ball-and-stick representation. The tryptophan residue interacts with the major groove of the DNA.
Figure 4
Figure 4
Distribution of predicted functional residues. Pie chart showing percentage of mutations affecting predicted functional residues for OMIM and dbSNP
Figure 5
Figure 5
Distribution of predicted structurally deleterious nsSNPs. Pie chart showing percentage of structurally deleterious predictions for OMIM and dbSNP datasets.

Similar articles

Cited by

References

    1. Collins FS, Brooks LD, Chakravarti A. A DNA Polymorphism Discovery Resource for Research on Human Genetic Variation. Genome Research. 1998;8:1229–1231. - PubMed
    1. The International HapMap Consortium A haplotype map of the human genome. Nature. 2005. pp. 4371299–1320. - PMC - PubMed
    1. The Hapmap database http://www.hapmap.org
    1. Clayton DG, Walker NM, Smyth DJ, Pask R, Cooper JD, Maier LM, Smink LJ, Lam AC, Ovington NR, Stevens HE, Nutland S, Howson JM, Faham M, Moorhead M, Jones HB, Falkowski M, Hardenbol P, Willis TD, Todd JA. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet. 2005;37:1243–6. doi: 10.1038/ng1653. - DOI - PubMed
    1. Fredman D, Siegfried M, Yuan YP, Bork P, Lehvaslaiho H, Brookes AJ. HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources. Nucleic Acids Res. 2002;30:387–91. doi: 10.1093/nar/30.1.387. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources