Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Aug 13;9(8):e104355.
doi: 10.1371/journal.pone.0104355. eCollection 2014.

Copy number variation in Thai population

Affiliations

Copy number variation in Thai population

Bhoom Suktitipat et al. PLoS One. .

Abstract

Copy number variation (CNV) is a major genetic polymorphism contributing to genetic diversity and human evolution. Clinical application of CNVs for diagnostic purposes largely depends on sufficient population CNV data for accurate interpretation. CNVs from general population in currently available databases help classify CNVs of uncertain clinical significance, and benign CNVs. Earlier studies of CNV distribution in several populations worldwide showed that a significant fraction of CNVs are population specific. In this study, we characterized and analyzed CNVs in 3,017 unrelated Thai individuals genotyped with the Illumina Human610, Illumina HumanOmniexpress, or Illumina HapMap550v3 platform. We employed hidden Markov model and circular binary segmentation methods to identify CNVs, extracted 23,458 CNVs consistently identified by both algorithms, and cataloged these high confident CNVs into our publicly available Thai CNV database. Analysis of CNVs in the Thai population identified a median of eight autosomal CNVs per individual. Most CNVs (96.73%) did not overlap with any known chromosomal imbalance syndromes documented in the DECIPHER database. When compared with CNVs in the 11 HapMap3 populations, CNVs found in the Thai population shared several characteristics with CNVs characterized in HapMap3. Common CNVs in Thais had similar frequencies to those in the HapMap3 populations, and all high frequency CNVs (>20%) found in Thai individuals could also be identified in HapMap3. The majorities of CNVs discovered in the Thai population, however, were of low frequency, or uniquely identified in Thais. When performing hierarchical clustering using CNV frequencies, the CNV data were clustered into Africans, Europeans, and Asians, in line with the clustering performed with single nucleotide polymorphism (SNP) data. As CNV data are specific to origin of population, our population-specific reference database will serve as a valuable addition to the existing resources for the investigation of clinical significance of CNVs in Thais and related ethnicities.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. CNV discovery in the Thai population.
a) Diagram showing Thai CNV discovery workflow; b) % overlap proportion of CNVs identified by both CNV Workshop and PennCNV based on CNV size (bp). The regions shaded in red correspond to CNVs exclusively discovered by CNV Workshop, while regions shaded in blue represent those jointly discovered by CNV Workshop and PennCNV.
Figure 2
Figure 2. CNV and CNVR comparison between the Thai and eleven HapMap3 populations.
a) Size distribution of the Thai CNVs and HapMap3 CNVs; b) Allele frequency spectrum of CNVs with frequency of at least 1% across the Thai and HapMap3 CNVRs; c) Degree of match between the Thai CNVRs and HapMap3 CNVRs with reference to allele frequency.
Figure 3
Figure 3. Hierarchical clustering analysis (HCA) of the 35 genes overlapping CNVs with statistically significantly different allele frequencies across HapMap3 populations as compared with Thais (permutation P-value <0.0002).
The color bar on the right shows the color codes assigned to each frequency range in percent.
Figure 4
Figure 4. Thai CNV database.
a) A screen-captured image of Thai CNV homepage (http://thaicnv.icbs.mahidol.ac.th/thaicnv/); b) An example of CNV search page. Red and blue lines indicate deletion and duplication CNVs, respectively. Arrowheads indicate the starting and ending genomic locations. Panel I - input panel; panel II - graphical view; panel III - table view.

Similar articles

Cited by

References

    1. Stankiewicz P, Lupski JR (2010) Structural variation in the human genome and its role in disease. Annu Rev Med 61: 437–455. - PubMed
    1. Grayton HM, Fernandes C, Rujescu D, Collier DA (2012) Copy number variations in neurodevelopmental disorders. Prog Neurobiol 99: 81–91. - PubMed
    1. Manning M, Hudgins L (2010) Array-based technology and recommendations for utilization in medical genetics practice for detection of chromosomal abnormalities. Genet Med 12: 742–745. - PMC - PubMed
    1. Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, et al. (2009) DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am J Hum Genet 84: 524–533. - PMC - PubMed
    1. MacDonald JR, Ziman R, Yuen RK, Feuk L, Scherer SW (2014) The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res 42: D986–992. - PMC - PubMed

Publication types