Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Aug;14(8):1664-8.
doi: 10.1101/gr.2421604.

Large-scale validation of single nucleotide polymorphisms in gene regions

Affiliations

Large-scale validation of single nucleotide polymorphisms in gene regions

Matthew R Nelson et al. Genome Res. 2004 Aug.

Abstract

Genome-wide association studies using large numbers of bi-allelic single nucleotide polymorphisms (SNPs) have been proposed as a potentially powerful method for identifying genes involved in common diseases. To assemble a SNP collection appropriate for large-scale association, we designed assays for 226,099 publicly available SNPs located primarily within known and predicted gene regions. Allele frequencies were estimated in a sample of 92 CEPH Caucasians using chip-based MALDI-TOF mass spectrometry with pooled DNA. Of the 204,200 designed assays that were functional, 125,799 SNPs were determined to be polymorphic (minor allele frequency > 0.02), of which 101,729 map uniquely to the human genome. Many of the commonly available RefSNP annotations were predictive of polymorphic status and could be used to improve the selection of SNPs from the public domain for genetic research. The set of uniquely mapping, polymorphic SNPs is located within 10 kb of 66% of known and predicted genes annotated in LocusLink, which could prove useful for large-scale disease association studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
SNP allele frequency distribution. Minor allele frequency (MAF) distribution of 158,295 functional SNP assays that map uniquely to the human genome from this study (SQNM CEPH; histogram and solid line) compared to the distribution of 61,173 SNPs with Caucasian frequencies estimated by TSC (TSC CAUC; dashed line). The distributions are presented as densities, such that the area under each curve sums to one. The y-axis is truncated at 10, although the left-most histogram bar for SQNM CEPH extends to 17.
Figure 2
Figure 2
Correlation between frequencies from this study and matched Caucasian TSC SNP allele frequencies. The relationship between allele frequencies for 5,644 SNPs measured in the current study (SQNM CEPH) and matched Caucasian frequencies in the TSC collection (TSC CAUC) is compared with a bivariate contour plot. Density lines are plotted every 0.5 units, ranging from 0.5 for the outermost line to the highest density of 3.0 for the innermost line. The allele presented was arbitrarily standardized to the MassEXTEND high mass allele from each assay in this study. SNPs with minor allele frequencies <0.05 in both SQNM CEPH and TSC CAUC samples were excluded from this plot. Lower frequency SNPs are compared at the bottom of Table 1.

References

    1. Altshuler, D., Pollara, V.J., Cowles, C.R., Van Etten, W.J., Baldwin, J., Linton, L., and Lander, E.S. 2000. An SNP map of the human genome generated by reduced representation shotgun sequencing. Nature 407: 513–516. - PubMed
    1. Bansal, A., van den Boom, D., Kammerer, S., Honisch, C., Adam, G., Cantor, C.R., Kleyn, P., and Braun, A. 2002. Association testing by DNA pooling: An effective initial screen. Proc. Natl. Acad. Sci. 99: 16871–16874. - PMC - PubMed
    1. Buetow, K.H., Edmonson, M.N., and Cassidy, A.B. 1999. Reliable identification of large numbers of candidate SNPs from public EST data. Nat. Genet. 21: 323–325. - PubMed
    1. Buetow, K.H., Edmonson, M., MacDonald, R., Clifford, R., Yip, P., Kelley, J., Little, D.P., Strausberg, R., Koester, H., Cantor, C.R., et al. 2001. High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Proc. Natl. Acad. Sci. 98: 581–584. - PMC - PubMed
    1. Carlson, C.S., Eberle, M.A., Rieder, M.J., Smith, J.D., Kruglyak, L., and Nickerson, D.A. 2003. Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans. Nat. Genet. 33: 518–521. - PubMed

WEB SITE REFERENCES

    1. http://www.ncbi.nlm.nih.gov/SNP/; NCBI dbSNP home page.
    1. http://snp.cshl.org; The SNP Consortium home page.

Publication types

MeSH terms

LinkOut - more resources