A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data
- PMID: 36906598
- PMCID: PMC10008016
- DOI: 10.1186/s13148-023-01459-z
A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data
Abstract
Background: The Infinium EPIC array measures the methylation status of > 850,000 CpG sites. The EPIC BeadChip uses a two-array design: Infinium Type I and Type II probes. These probe types exhibit different technical characteristics which may confound analyses. Numerous normalization and pre-processing methods have been developed to reduce probe type bias as well as other issues such as background and dye bias.
Methods: This study evaluates the performance of various normalization methods using 16 replicated samples and three metrics: absolute beta-value difference, overlap of non-replicated CpGs between replicate pairs, and effect on beta-value distributions. Additionally, we carried out Pearson's correlation and intraclass correlation coefficient (ICC) analyses using both raw and SeSAMe 2 normalized data.
Results: The method we define as SeSAMe 2, which consists of the application of the regular SeSAMe pipeline with an additional round of QC, pOOBAH masking, was found to be the best performing normalization method, while quantile-based methods were found to be the worst performing methods. Whole-array Pearson's correlations were found to be high. However, in agreement with previous studies, a substantial proportion of the probes on the EPIC array showed poor reproducibility (ICC < 0.50). The majority of poor performing probes have beta values close to either 0 or 1, and relatively low standard deviations. These results suggest that probe reliability is largely the result of limited biological variation rather than technical measurement variation. Importantly, normalizing the data with SeSAMe 2 dramatically improved ICC estimates, with the proportion of probes with ICC values > 0.50 increasing from 45.18% (raw data) to 61.35% (SeSAMe 2).
Keywords: DNA methylation; ICC; Illumina EPIC array; Normalization; Reproducibility.
© 2023. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2. Clin Epigenetics. 2018. PMID: 30326963 Free PMC article.
-
Complete pipeline for Infinium(®) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation.Epigenomics. 2012 Jun;4(3):325-41. doi: 10.2217/epi.12.21. Epigenomics. 2012. PMID: 22690668
-
Correlation of Infinium HumanMethylation450K and MethylationEPIC BeadChip arrays in cartilage.Epigenetics. 2020 Jun-Jul;15(6-7):594-603. doi: 10.1080/15592294.2019.1700003. Epub 2019 Dec 13. Epigenetics. 2020. PMID: 31833794 Free PMC article.
-
Improved filtering of DNA methylation microarray data by detection p values and its impact on downstream analyses.Clin Epigenetics. 2019 Jan 24;11(1):15. doi: 10.1186/s13148-019-0615-3. Clin Epigenetics. 2019. PMID: 30678737 Free PMC article. Review.
-
Statistical approaches for the analysis of DNA methylation microarray data.Hum Genet. 2011 Jun;129(6):585-95. doi: 10.1007/s00439-011-0993-x. Epub 2011 Apr 26. Hum Genet. 2011. PMID: 21519831 Free PMC article. Review.
Cited by
-
A meta-analysis of immune-cell fractions at high resolution reveals novel associations with common phenotypes and health outcomes.Genome Med. 2023 Jul 31;15(1):59. doi: 10.1186/s13073-023-01211-5. Genome Med. 2023. PMID: 37525279 Free PMC article.
-
Critical evaluation of the reliability of DNA methylation probes on the Illumina MethylationEPIC BeadChip microarrays.Res Sq [Preprint]. 2023 Oct 17:rs.3.rs-3068938. doi: 10.21203/rs.3.rs-3068938/v2. Res Sq. 2023. Update in: Epigenetics. 2024 Dec;19(1):2333660. doi: 10.1080/15592294.2024.2333660. PMID: 37461726 Free PMC article. Updated. Preprint.
-
Critical evaluation of the reliability of DNA methylation probes on the Illumina MethylationEPIC v1.0 BeadChip microarrays.Epigenetics. 2024 Dec;19(1):2333660. doi: 10.1080/15592294.2024.2333660. Epub 2024 Apr 2. Epigenetics. 2024. PMID: 38564759 Free PMC article.
-
Epitranscriptomic analysis reveals clinical and molecular signatures in glioblastoma.Acta Neuropathol Commun. 2025 Apr 11;13(1):74. doi: 10.1186/s40478-025-01966-5. Acta Neuropathol Commun. 2025. PMID: 40217422 Free PMC article.
-
Fresh and frozen cardiac tissue are comparable in DNA methylation array β-values, but formalin-fixed, paraffin-embedded tissue may overestimate DNA methylation levels.Sci Rep. 2023 Sep 29;13(1):16381. doi: 10.1038/s41598-023-43788-2. Sci Rep. 2023. PMID: 37773256 Free PMC article.
References
-
- Zaimi I, Pei D, Koestler DC, Marsit CJ, De Vivo I, Tworoger SS, Shields AE, Kelsey KL, Michaud DS. Variation in DNA methylation of human blood over a 1-year period using the illumina MethylationEPIC array. Epigenetics. 2018;13(10–11):1056–1071. doi: 10.1080/15592294.2018.1530008. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources