Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Jan 27;13(Suppl 7):77-83.
doi: 10.4137/CIN.S16345. eCollection 2014.

Hidden Markov Model-Based CNV Detection Algorithms for Illumina Genotyping Microarrays

Affiliations
Review

Hidden Markov Model-Based CNV Detection Algorithms for Illumina Genotyping Microarrays

Eric L Seiser et al. Cancer Inform. .

Abstract

Somatic alterations in DNA copy number have been well studied in numerous malignancies, yet the role of germline DNA copy number variation in cancer is still emerging. Genotyping microarrays generate allele-specific signal intensities to determine genotype, but may also be used to infer DNA copy number using additional computational approaches. Numerous tools have been developed to analyze Illumina genotype microarray data for copy number variant (CNV) discovery, although commonly utilized algorithms freely available to the public employ approaches based upon the use of hidden Markov models (HMMs). QuantiSNP, PennCNV, and GenoCN utilize HMMs with six copy number states but vary in how transition and emission probabilities are calculated. Performance of these CNV detection algorithms has been shown to be variable between both genotyping platforms and data sets, although HMM approaches generally outperform other current methods. Low sensitivity is prevalent with HMM-based algorithms, suggesting the need for continued improvement in CNV detection methodologies.

Keywords: copy number variation; genotyping microarray; hidden Markov model.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Size of CNVs detected in the HapMap samples using QuantiSNP, PennCNV, and GenoCN. The HMM-based CNV detection tools were applied to Illumina Human610-Quad BeadChip v1.0 data from three HapMap samples of European ancestry (NA06985, NA06991, and NA06993). For each sample, boxplots were generated for CNV sizes from each HMM-based detection method. Boxplots on the left are CNV sizes measured in genomic length (base pairs), and boxplots on the right are CNV sizes measured by the number of genotype microarray probes in the detected region.

Similar articles

Cited by

References

    1. Lenz G, Wright GW, Emre NC, et al. Molecular subtypes of diffuse large B-cell lymphoma arise by distinct genetic pathways. Proc Natl Acad Sci USA. 2008;105(36):13520–5. - PMC - PubMed
    1. Lee E-S, Kim L-H, Abdullah WA, Peh SC. Expression and alteration of p16 in diffuse large B cell lymphoma. Pathobiology. 2010;77(2):96–105. - PubMed
    1. Jardin F, Jais JP, Molina TJ, et al. Diffuse large B-cell lymphomas with CDKN2 A deletion have a distinct gene expression signature and a poor prognosis under R-CHOP treatment: a GELA study. Blood. 2010;116(7):1092–104. - PubMed
    1. Owens MA, Horten BC, Da Silva MM. HER2 amplification ratios by fluorescence in situ hybridization and correlation with immunohistochemistry in a cohort of 6556 breast cancer tissues. Clin Breast Cancer. 2004;5(1):63–9. - PubMed
    1. Park JW, Neve RM, Szollosi J, Benz CC. Unraveling the biologic and clinical complexities of HER2. Clin Breast Cancer. 2008;8(5):392–401. - PubMed