Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Oct 22;5(10):e13574.
doi: 10.1371/journal.pone.0013574.

Non-synonymous and synonymous coding SNPs show similar likelihood and effect size of human disease association

Affiliations

Non-synonymous and synonymous coding SNPs show similar likelihood and effect size of human disease association

Rong Chen et al. PLoS One. .

Abstract

Many DNA variants have been identified on more than 300 diseases and traits using Genome-Wide Association Studies (GWASs). Some have been validated using deep sequencing, but many fewer have been validated functionally, primarily focused on non-synonymous coding SNPs (nsSNPs). It is an open question whether synonymous coding SNPs (sSNPs) and other non-coding SNPs can lead to as high odds ratios as nsSNPs. We conducted a broad survey across 21,429 disease-SNP associations curated from 2,113 publications studying human genetic association, and found that nsSNPs and sSNPs shared similar likelihood and effect size for disease association. The enrichment of disease-associated SNPs around the 80(th) base in the first introns might provide an effective way to prioritize intronic SNPs for functional studies. We further found that the likelihood of disease association was positively associated with the effect size across different types of SNPs, and SNPs in the 3' untranslated regions, such as the microRNA binding sites, might be under-investigated. Our results suggest that sSNPs are just as likely to be involved in disease mechanisms, so we recommend that sSNPs discovered from GWAS should also be examined with functional studies.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A curated quantitative disease-SNP association database.
Starting from a list of all SNPs measured in the HapMap 3 project, we searched for their presence in all Medline abstracts, eliminating non-human studies. Significant SNP-disease associations were manually curated from the full text, and reviewed four rounds. SNP IDs were annotated using the UCSC genome browser for positions and function types and annotated using Entrez for associated genes. Disease mesh terms were compared with the Unified Medical Language System (UMLS) to select concept unique identifiers (CUIs).
Figure 2
Figure 2. Impact of SNP types on the likelihood of disease association.
For each of the nine SNP types, the likelihood of disease associations was estimated as the percentage of disease-associated SNPs among all SNPs measured in the HapMap 3 project. Similar likelihoods were observed among nsSNPs, sSNPs, and SNPs in the 5′-UTR. The number of disease-associated SNPs and SNPs measured in the HapMap 3 project were specified in the parenthesis.
Figure 3
Figure 3. Cumulative density of disease-associated SNPs in the first intron.
The cumulative densities of disease-associated SNPs were plotted against the distance from the first intron start site. The density was peaked at the 80th base from the first intron start site.
Figure 4
Figure 4. Impact of SNP types on the effect size of SNP-disease association.
The median odds ratio±standard error was calculated for each of the nine SNP types using 9,574 curated SNP-disease associations. The number of distinct SNPs was specified in the parenthesis. The odds ratios of nonsense SNPs were significantly higher than those of other SNP types (p<0.05, Mann-whiney U test). The p-values between the odds ratios of nsSNPs and other type of SNPs were shown in the figure.
Figure 5
Figure 5. SNPs in the 3′-UTR were under-investigated.
The median odds ratio of SNP-disease associations was plotted against the percentage of disease-associated SNPs for each of the nine SNP types. Assuming the median odds ratio will not be increased with more studies performed, the percentage of disease-associated SNPs will likely be increased to fit the curve.

References

    1. Stenson PD, Mort M, Ball EV, Howells K, Phillips AD, et al. The Human Gene Mutation Database: 2008 update. Genome Med. 2009;1:13. - PMC - PubMed
    1. Becker KG, Barnes KC, Bright TJ, Wang SA. The genetic association database. Nat Genet. 2004;36:431–432. - PubMed
    1. Amberger J, Bocchini CA, Scott AF, Hamosh A. McKusick's Online Mendelian Inheritance in Man (OMIM). Nucleic Acids Res. 2009;37:D793–796. - PMC - PubMed
    1. Johnson AD, O'Donnell CJ. An open access database of genome-wide association results. BMC Med Genet. 2009;10:6. - PMC - PubMed
    1. Hindorff LA, Sethupathy P, Junkins HA, Ramos EM, Mehta JP, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009;106:9362–9367. - PMC - PubMed

Publication types

Substances