Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2000 Oct;10(10):1532-45.
doi: 10.1101/gr.146900.

Sequence diversity and large-scale typing of SNPs in the human apolipoprotein E gene

Affiliations

Sequence diversity and large-scale typing of SNPs in the human apolipoprotein E gene

D A Nickerson et al. Genome Res. 2000 Oct.

Abstract

A common strategy for genotyping large samples begins with the characterization of human single nucleotide polymorphisms (SNPs) by sequencing candidate regions in a small sample for SNP discovery. This is usually followed by typing in a large sample those sites observed to vary in a smaller sample. We present results from a systematic investigation of variation at the human apolipoprotein E locus (APOE), as well as the evaluation of the two-tiered sampling strategy based on these data. We sequenced 5.5 kb spanning the entire APOE genomic region in a core sample of 72 individuals, including 24 each of African-Americans from Jackson, Mississippi; European-Americans from Rochester, Minnesota; and Europeans from North Karelia, Finland. This sequence survey detected 21 SNPs and 1 multiallelic indel, 14 of which had not been previously reported. Alleles varied in relative frequency among the populations, and 10 sites were polymorphic in only a single population sample. Oligonucleotide ligation assays (OLA) were developed for 20 of these sites (omitting the indel and a closely-linked SNP). These were then scored in 2179 individuals sampled from the same three populations (n = 843, 884, and 452, respectively). Relative allele frequencies were generally consistent with estimates from the core sample, although variation was found in some populations in the larger sample at SNPs that were monomorphic in the corresponding smaller core sample. Site variation in the larger samples showed no systematic deviation from Hardy-Weinberg expectation. The large OLA sample clearly showed that variation in many, but not all, of OLA-typed SNPs is significantly correlated with the classical protein-coding variants, implying that there may be important substructure within the classical epsilon 2, epsilon 3, and epsilon 4 alleles. Comparison of the levels and patterns of polymorphism in the core samples with those estimated for the OLA-typed samples shows how nucleotide diversity is underestimated when only a subset of sites are typed and underscores the importance of adequate population sampling at the polymorphism discovery stage. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF261279.]

PubMed Disclaimer

Figures

Figure 1
Figure 1
Genomic structure and locations of single nucleotide polymorphisms in APOE. (a) The exon-intron structure. (b) The distribution of mapped protein-binding sites in the 5′ flanking sequence and first intron. (c) The types and distribution of repeat sequences. (d) DNA variants identified by sequencing 72 individuals. Coding-region variants are boxed. (e) Percentage identity plot for a comparison of human and mouse genomic sequences for APOE. The regions associated with the exons are indicated by number (1–4). The two regions associated with enhancer activities are indicated by letter (a or b).
Figure 2
Figure 2
Visual genotypes of APOE in the core data. Color codes represent the genotypes at each polymorphic site in each individual in the sample: blue for homozygous for the common allele, red for heterozygous, and yellow for homozygous for the rarer allele. (a) Jackson sample, (b) North Karelia sample, and (c) Rochester sample. Individuals homozygous across the locus are marked with an asterisk.
Figure 3
Figure 3
FST estimates for the OLA-typed single nucleotide polymorphisms (SNPs). The site-specific estimate for each of the 20 SNPs typed in the large epidemiological samples is shown, plotted relative to the genomic location of each site in the 5.5-kb sequenced region. The location of the APOE exons within the sequenced region, as presented in Fig. 1, are shown directly below the plot.

Similar articles

Cited by

References

    1. Artiga MJ, Bullido MJ, Frank A, Sastre I, Recuero M, Garcia MA, Lendon CL, Han SW, Morris JC, Vazquez J, et al. Risk for Alzheimer's disease correlates with transcriptional activity of the APOEgene. Hum Mol Genet. 1998a;7:1887–1892. - PubMed
    1. Artiga MJ, Bullido MJ, Sastre I, Recuero M, Gracia MA, Aldudo J, Vazquez J, Valdivieso F. Allelic polymorphisms in the transcriptional regulatory region of apolipoprotein E gene. FEBS Lett. 1998b;421:105–108. - PubMed
    1. Bowcock AM, Kidd JR, Mountain JL, Hebert JM, Carotenuto L, Kidd KK, Cavalli-Sforza LL. Drift, admixture, and selection in human evolution: A study with DNA polymorphisms. Proc Natl Acad Sci. 1991;88:839–843. - PMC - PubMed
    1. Bullido MJ, Artiga MJ, Recuero M, Sastre I, Garcia MA, Aldudo J, Lendon C, Han SW, Morris JC, Frank A, et al. A polymorphism in the regulatory region of APOE associated with risk for Alzheimer's dementia. Nat Genet. 1998;18:69–71. - PubMed
    1. Collins FS, Guyer MS, Chakravarti A. Variations on a theme: Cataloging human DNA sequence variation. Science. 1997;278:1580–1581. - PubMed

Publication types

Associated data

LinkOut - more resources