Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 11;15(1):41.
doi: 10.1186/s13148-023-01459-z.

A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data

Affiliations

A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data

H Welsh et al. Clin Epigenetics. .

Abstract

Background: The Infinium EPIC array measures the methylation status of > 850,000 CpG sites. The EPIC BeadChip uses a two-array design: Infinium Type I and Type II probes. These probe types exhibit different technical characteristics which may confound analyses. Numerous normalization and pre-processing methods have been developed to reduce probe type bias as well as other issues such as background and dye bias.

Methods: This study evaluates the performance of various normalization methods using 16 replicated samples and three metrics: absolute beta-value difference, overlap of non-replicated CpGs between replicate pairs, and effect on beta-value distributions. Additionally, we carried out Pearson's correlation and intraclass correlation coefficient (ICC) analyses using both raw and SeSAMe 2 normalized data.

Results: The method we define as SeSAMe 2, which consists of the application of the regular SeSAMe pipeline with an additional round of QC, pOOBAH masking, was found to be the best performing normalization method, while quantile-based methods were found to be the worst performing methods. Whole-array Pearson's correlations were found to be high. However, in agreement with previous studies, a substantial proportion of the probes on the EPIC array showed poor reproducibility (ICC < 0.50). The majority of poor performing probes have beta values close to either 0 or 1, and relatively low standard deviations. These results suggest that probe reliability is largely the result of limited biological variation rather than technical measurement variation. Importantly, normalizing the data with SeSAMe 2 dramatically improved ICC estimates, with the proportion of probes with ICC values > 0.50 increasing from 45.18% (raw data) to 61.35% (SeSAMe 2).

Keywords: DNA methylation; ICC; Illumina EPIC array; Normalization; Reproducibility.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The normalization pipelines for all normalization methods considered
Fig. 2
Fig. 2
Boxplot of correlation between the replicate samples for each method, ordered by median
Fig. 3
Fig. 3
Box plot showing SD versus Pearson’s correlation based on SeSAMe 2 normalized beta values, using equidistant SD categories in the X axis

Similar articles

Cited by

References

    1. Moore LD, Le T, Fan G. DNA methylation and its basic function. Neuropsychopharmacol. 2013;28(1):23–38. doi: 10.1038/npp.2012.112. - DOI - PMC - PubMed
    1. Smith ZD, Meissner A. DNA methylation: roles inmammalian development. Nat Rev Genet. 2013;14(3):204–220. doi: 10.1038/nrg3354. - DOI - PubMed
    1. Zaimi I, Pei D, Koestler DC, Marsit CJ, De Vivo I, Tworoger SS, Shields AE, Kelsey KL, Michaud DS. Variation in DNA methylation of human blood over a 1-year period using the illumina MethylationEPIC array. Epigenetics. 2018;13(10–11):1056–1071. doi: 10.1080/15592294.2018.1530008. - DOI - PMC - PubMed
    1. Brunet A, Berger SL. Epigenetics of aging and aging-related disease. J Gerontol A Biol. 2014;69(S1):S17–S20. doi: 10.1093/gerona/glu042. - DOI - PMC - PubMed
    1. Johnson AA, Akman K, Calimport SR, Wuttke D, Stolzing A, De Magalhaes JP. The role of DNA methylation in aging, rejuvenation, and age-related disease. Rejuvenation Res. 2012;15(5):483–494. doi: 10.1089/rej.2012.1324. - DOI - PMC - PubMed

Publication types