Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Mar 15;19(6):1001-8.
doi: 10.1093/hmg/ddp564. Epub 2009 Dec 21.

Copy number variations in East-Asian population and their evolutionary and functional implications

Affiliations

Copy number variations in East-Asian population and their evolutionary and functional implications

Seon-Hee Yim et al. Hum Mol Genet. .

Abstract

Recent discovery of the copy number variation (CNV) in normal individuals has widened our understanding of genomic variation. However, most of the reported CNVs have been identified in Caucasians, which may not be directly applicable to people of different ethnicities. To profile CNV in East-Asian population, we screened CNVs in 3578 healthy, unrelated Korean individuals, using the Affymetrix Genome-Wide Human SNP array 5.0. We identified 144,207 CNVs using a pooled data set of 100 randomly chosen Korean females as a reference. The average number of CNVs per genome was 40.3, which is higher than that of CNVs previously reported using lower resolution platforms. The median size of CNVs was 18.9 kb (range 0.2-5406 kb). Copy number losses were 4.7 times more frequent than copy number gains. CNV regions (CNVRs) were defined by merging overlapping CNVs identified in two or more samples. In total, 4003 CNVRs were defined encompassing 241.9 Mb accounting for approximately 8% of the human genome. A total of 2077 CNVRs (51.9%) were potentially novel. Known CNVRs were larger and more frequent than novel CNVRs. Sixteen percent of the CNVRs were observed in > or =1% of study subjects and 24% overlapped with the OMIM genes. A total of 476 (11.9%) CNVRs were associated with segmental duplications. CNVS/CNVRs identified in this study will be valuable resources for studying human genome diversity and its association with disease.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Size distribution of CNVs (A) and CNVRs (B) from this study with corresponding CNV/CNVRs from the DGV (August 2009 version). X-axis, the sizes in kilobases. Y-axis, the proportions of CNV/CNVRs within each size bin.
Figure 2.
Figure 2.
Examples of signal intensity ratio plots of CNVRs. (A) A CNVR on 2p12, gain only. (B) A CNVR on 10g21.1, loss only. (C) A CNVR on 4p11, gain/loss complex. X-axis, genomic coordinates (Mb). Y-axis, signal intensity ratios (test/reference) in log2 scale.
Figure 3.
Figure 3.
Degree of match between CNVRs from this study and DGV CNVRs with respect to the allele frequency. X-axis, the degree of match (%). Y-axis, the allele frequency of CNVRs.
Figure 4.
Figure 4.
Association of retrotransposons with CNVRs. (A) The mean regional fractions of the three retrotransposons around CNVRs by distance from the CNVRs. X-axis, the separating distance from the CNVRs. Y-axis, the fractions (%) of the retroelements. 95% confidence intervals are shown with error bars. (B) Divergence rates of the three retrotransposons around the CNVRs. X-axis, the separating distance from the CNVRs. Y-axis, the mean milliDivergence of the retroelements. 250 milliDivergence corresponds to 25.0% of divergence rates. The orientation of repeat sequences was determined with respect to the corresponding CNVRs; concordant elements are those whose tails are more closely located to CNVRs; divergent elements are those with opposite orientation. Calculation of milliDivergence was performed following the description at the RepeatMaskers (http://www.repeatmasker.org/).

Similar articles

Cited by

References

    1. Feuk L., Carson A.R., Scherer S.W. Structural variation in the human genome. Nat. Rev. Genet. 2006;7:85–97. - PubMed
    1. Estivill X., Armengol L. Copy number variants and common disorders: filling the gaps and exploring complexity in genome-wide association studies. PLoS Genet. 2007;3:1787–1799. - PMC - PubMed
    1. Iafrate A.J., Feuk L., Rivera M.N., Listewnik M.L., Donahoe P.K., Qi Y., Scherer S.W., Lee C. Detection of large-scale variation in the human genome. Nat. Genet. 2004;36:949–951. - PubMed
    1. Sebat J., Lakshmi B., Troge J., Alexander J., Young J., Lundin P., Månér S., Massa H., Walker M., Chi M., et al. Large-scale copy number polymorphism in the human genome. Science. 2004;305:525–528. - PubMed
    1. Simon-Sanchez J., Scholz S., Fung H.C., Matarin M., Hernandez D., Gibbs J.R., Britton A., de Vrieze F.W., Peckham E., Gwinn-Hardy K., et al. Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals. Hum. Mol. Genet. 2007;16:1–14. - PubMed

Publication types