Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2011 Jun;129(6):585-95.
doi: 10.1007/s00439-011-0993-x. Epub 2011 Apr 26.

Statistical approaches for the analysis of DNA methylation microarray data

Affiliations
Review

Statistical approaches for the analysis of DNA methylation microarray data

Kimberly D Siegmund. Hum Genet. 2011 Jun.

Abstract

Following the rapid development and adoption in DNA methylation microarray assays, we are now experiencing a growth in the number of statistical tools to analyze the resulting large-scale data sets. As is the case for other microarray applications, biases caused by technical issues are of concern. Some of these issues are old (e.g., two-color dye bias and probe- and array-specific effects), while others are new (e.g., fragment length bias and bisulfite conversion efficiency). Here, I highlight characteristics of DNA methylation that suggest standard statistical tools developed for other data types may not be directly suitable. I then describe the microarray technologies most commonly in use, along with the methods used for preprocessing and obtaining a summary measure. I finish with a section describing downstream analyses of the data, focusing on methods that model percentage DNA methylation as the outcome, and methods for integrating DNA methylation with gene expression or genotype data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Three main approaches to DNA methylation microarray analysis. A) Black circles denote methylated CpGs and white circles denote unmethylated CpGs. B) Illumina’s bisulfite treatmentbased approach. Cy3/Cy5 labeling varies between Infinium I and Infinium II probes, C) Affinity enrichment using methylcytosine immunoprecipitation (IP) D) Methylation-sensitive restriction digestion using McrBC.

References

    1. Agius P, Campbell C. Bayesian Unsupervised Learning with Multiple Data Types Bayesian Unsupervised Learning with Multiple Data Types. Statistical Applications in Genetics and Molecular Biology. 2009;8 Article 27. - PubMed
    1. Aryee MJ, Wu Z, Ladd-Acosta C, Herb B, Feinberg AP, Yegnasubramanian S, Irizarry RA. Accurate genome-scale percentage DNA methylation estimates from microarray data. Biostatistics. 2010:1–14. - PMC - PubMed
    1. Bell JT, Pai AA, Pickrell JK, Gaffney DJ, Pique-Regi R, Degner JF, Gilad Y, Pritchard JK. DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome Biology. 2011;12:R10. - PMC - PubMed
    1. Bibikova M, Lin Z, Zhou L, Chudin E, Garcia EW, Wu B, Doucet D, Thomas NJ, Wang Y, Vollmer E, Goldmann T, Seifart C, Jiang W, Barker DL, Chee MS, Floros J, Fan J-B. High-throughput DNA methylation profiling using universal bead arrays. Genome Research. 2006;16:383–393. - PMC - PubMed
    1. Bird A. DNA methylation patterns and epigenetic memory. Genes & Development. 2002;16:6–21. - PubMed

Publication types