Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 3;427(13):2271-89.
doi: 10.1016/j.jmb.2015.04.014. Epub 2015 May 1.

Genetic Basis of Common Human Disease: Insight into the Role of Missense SNPs from Genome-Wide Association Studies

Affiliations

Genetic Basis of Common Human Disease: Insight into the Role of Missense SNPs from Genome-Wide Association Studies

Lipika R Pal et al. J Mol Biol. .

Abstract

Recent genome-wide association studies (GWAS) have led to the reliable identification of single nucleotide polymorphisms (SNPs) at a number of loci associated with increased risk of specific common human diseases. Each such locus implicates multiple possible candidate SNPs for involvement in disease mechanism. A variety of mechanisms may link the presence of an SNP to altered in vivo gene product function and hence contribute to disease risk. Here, we report an analysis of the role of one of these mechanisms, missense SNPs (msSNPs) in proteins in seven complex trait diseases. Linkage disequilibrium information was used to identify possible candidate msSNPs associated with increased disease risk at each of 356 loci for the seven diseases. Two computational methods were used to estimate which of these SNPs has a significant impact on in vivo protein function. 69% of the loci have at least one candidate msSNP and 33% have at least one predicted high-impact msSNP. In some cases, these SNPs are in well-established disease-related proteins, such as MST1 (macrophage stimulating 1) for Crohn's disease. In others, they are in proteins identified by GWAS as likely candidates for disease relevance, but previously without known mechanism, such as ADAMTS13 (ADAM metallopeptidase with thrombospondin type 1 motif, 13) for coronary artery disease. In still other cases, the missense SNPs are in proteins not previously suggested as disease candidates, such as TUBB1 (tubulin, beta 1, class VI) for hypertension. Together, these data support a substantial role for this class of SNPs in susceptibility to common human disease.

Keywords: GWAS; complex trait disease; nonsynonymous; protein structure.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Fraction of loci containing putative high impact missense (red bars) and any missense SNPs (blue bars) for each of the seven diseases. The fraction of high impact SNPs varies widely.
Figure 2
Figure 2
Distribution of D′ and r2 for the high impact candidate missense SNPs. Most high impact SNPs have high D′ values, but small r2.
Figure 3
Figure 3
Distribution of odds ratio of risk alleles of high impact missense SNPs. Most odds ratios are reasonably small, with 51% less than 2.
Figure 4
Figure 4
Examples of proposed high impact missense SNPs involved in complex trait disease mechanism. (a) Crohn’s Disease candidate SNP in MST1 (Macrophage stimulating protein) (green chain), resulting in reduced binding affinity to the RON receptor (cyan) and so reduced macrophage signaling. The ARG 703 (yellow) substitution by CYS (purple) lies close to the protein-protein interface (modeled from the hetero complex, PDB ID: 4QT8). (b) Type 1 Diabetes and Rheumatoid Arthritis candidate SNP in PTPN22 (protein tyrosine phosphatase non-receptor type 22), weakening a protein-protein association, affecting T-cell activation. The ARG 620 (yellow) to TRP substitution in PTPN22 (cyan chain) introduces a steric clash (red disks) with W47 of the SH3 of Csk (c-src tyrosine protein kinase) (green chain) (from NMR model, PDB ID: 1JEG). (c) Coronary Artery Disease candidate SNP in ADAMTS13 (ADAM metallopeptidase with thrombospondin type 1 motif, 13), reducing the level of mature protein, and hence reducing cleavage of VWF (Von Willebrand factor) and downstream processes related to plaque formation. The PRO 618 (yellow) to ALA (purple) substitution lies in the β6–β7 loop of the spacer domain (green) and interacts with the CA domain (cyan) (modeled from PDB ID: 3GHM). (d) Type 1 Diabetes candidate SNP in IL7R (interleukin 7 receptor), putatively impairing interaction with IL7, and so affecting immune signaling. The HIS 165 (yellow) to GLU (purple) substitution affects a surface loop involved in interaction with the gamma chain of the signaling complex. (modeled from PDB ID: 3UP1). (e) Crohn’s Disease candidate SNP in PLCL1 (Phospholipase C-like 1). The VAL 667 (yellow) substitution to the bulkier ILE (purple) likely distorts the substrate binding site, affecting the catalytic efficiency (modeled from PDB ID: 1DJX). (f) Type 1 Diabetes candidate SNP in SIRPB1 (signal regulatory protein beta 1) involved in immune regulation. The ILE 229 (yellow) substitution by bulkier MET (purple) results in destabilization of the protein structure (PDB ID: 2WNG). The exact role of this protein immune regulation is not yet clear. (g) Hypertension candidate SNP in TUBB1 (tubulin beta 1, class VI) related to Coronary Syndrome and Myocardial Infarction. The GLU 43 (yellow) to PRO (purple) substitution results in predicted destabilization of the protein structure through loss of polar-polar residue interactions, expected to result in lower in vivo concentration (modeled from PDB ID: 3RYC). (h) Rheumatoid Arthritis candidate SNP in INHBC (inhibin, beta C), related to the serum uric acid level. Substitution of ARG 322 (yellow) by GLN (purple) in INHBC (inhibin, beta C) is predicted to destabilize the protein structure through loss of salt bridges (red broken lines) with a spatially close conserved aspartic acid, expected to result in lower in vivo concentration (modeled from PDB ID: 2ARV).
Figure 5
Figure 5
Workflow for the identification of candidate high impact msSNPs potentially involved in disease mechanism.

Similar articles

Cited by

References

    1. Jostins L, Ripke S, Weersma RK, Duerr RH, McGovern DP, Hui KY, et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature. 2012;491:119–24. doi: 10.1038/nature11582. - DOI - PMC - PubMed
    1. dbSNP n.d. http://www.ncbi.nlm.nih.gov/projects/SNP/.

    1. Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, et al. Integrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–8. doi: 10.1038/nature09298. - DOI - PMC - PubMed
    1. Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, et al. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–73. doi: 10.1038/nature09534. - DOI - PMC - PubMed
    1. Kryukov GV, Pennacchio LA, Sunyaev SR. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am J Hum Genet. 2007;80:727–39. doi: 10.1086/513473. - DOI - PMC - PubMed

Publication types