Estimands in epigenome-wide association studies
- PMID: 33926513
- PMCID: PMC8086103
- DOI: 10.1186/s13148-021-01083-9
Estimands in epigenome-wide association studies
Abstract
Background: In DNA methylation analyses like epigenome-wide association studies, effects in differentially methylated CpG sites are assessed. Two kinds of outcomes can be used for statistical analysis: Beta-values and M-values. M-values follow a normal distribution and help to detect differentially methylated CpG sites. As biological effect measures, differences of M-values are more or less meaningless. Beta-values are of more interest since they can be interpreted directly as differences in percentage of DNA methylation at a given CpG site, but they have poor statistical properties. Different frameworks are proposed for reporting estimands in DNA methylation analysis, relying on Beta-values, M-values, or both.
Results: We present and discuss four possible approaches of achieving estimands in DNA methylation analysis. In addition, we present the usage of M-values or Beta-values in the context of bioinformatical pipelines, which often demand a predefined outcome. We show the dependencies between the differences in M-values to differences in Beta-values in two data simulations: a analysis with and without confounder effect. Without present confounder effects, M-values can be used for the statistical analysis and Beta-values statistics for the reporting. If confounder effects exist, we demonstrate the deviations and correct the effects by the intercept method. Finally, we demonstrate the theoretical problem on two large human genome-wide DNA methylation datasets to verify the results.
Conclusions: The usage of M-values in the analysis of DNA methylation data will produce effect estimates, which cannot be biologically interpreted. The parallel usage of Beta-value statistics ignores possible confounder effects and can therefore not be recommended. Hence, if the differences in Beta-values are the focus of the study, the intercept method is recommendable. Hyper- or hypomethylated CpG sites must then be carefully evaluated. If an exploratory analysis of possible CpG sites is the aim of the study, M-values can be used for inference.
Keywords: DNA methylation; Epigenome-wide association study (EWAS); Estimands; Multiple testing; Reproducible research.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures





Similar articles
-
Epigenome-wide association study of incident type 2 diabetes: a meta-analysis of five prospective European cohorts.Diabetologia. 2022 May;65(5):763-776. doi: 10.1007/s00125-022-05652-2. Epub 2022 Feb 15. Diabetologia. 2022. PMID: 35169870 Free PMC article.
-
Epigenome-wide association study of DNA methylation in panic disorder.Clin Epigenetics. 2017 Jan 21;9:6. doi: 10.1186/s13148-016-0307-1. eCollection 2017. Clin Epigenetics. 2017. PMID: 28149334 Free PMC article.
-
Identification of Diagnostic CpG Signatures in Patients with Gestational Diabetes Mellitus via Epigenome-Wide Association Study Integrated with Machine Learning.Biomed Res Int. 2021 May 19;2021:1984690. doi: 10.1155/2021/1984690. eCollection 2021. Biomed Res Int. 2021. PMID: 34104645 Free PMC article.
-
Ten Years of EWAS.Adv Sci (Weinh). 2021 Oct;8(20):e2100727. doi: 10.1002/advs.202100727. Epub 2021 Aug 11. Adv Sci (Weinh). 2021. PMID: 34382344 Free PMC article. Review.
-
Data Analysis of DNA Methylation Epigenome-Wide Association Studies (EWAS): A Guide to the Principles of Best Practice.Methods Mol Biol. 2022;2458:23-45. doi: 10.1007/978-1-0716-2140-0_2. Methods Mol Biol. 2022. PMID: 35103960 Review.
Cited by
-
External validation of integrated genetic-epigenetic biomarkers for predicting incident coronary heart disease.Epigenomics. 2021 Jul;13(14):1095-1112. doi: 10.2217/epi-2021-0123. Epub 2021 Jun 21. Epigenomics. 2021. PMID: 34148365 Free PMC article.
-
Time-varying mediation analysis for incomplete data with application to DNA methylation study for PTSD.bioRxiv [Preprint]. 2025 Mar 12:2024.02.06.579228. doi: 10.1101/2024.02.06.579228. bioRxiv. 2025. PMID: 40161631 Free PMC article. Preprint.
-
Digital methylation assessments of alcohol and cigarette consumption account for common variance in accelerated epigenetic ageing.Epigenetics. 2022 Dec;17(13):1991-2005. doi: 10.1080/15592294.2022.2100684. Epub 2022 Jul 22. Epigenetics. 2022. PMID: 35866695 Free PMC article.
-
Epigenetic biomarkers for smoking cessation.Addict Neurosci. 2023 Jun;6:100079. doi: 10.1016/j.addicn.2023.100079. Epub 2023 Mar 1. Addict Neurosci. 2023. PMID: 37123087 Free PMC article.
-
Transcriptomics and epigenetic data integration learning module on Google Cloud.Brief Bioinform. 2024 Jul 23;25(Supplement_1):bbae352. doi: 10.1093/bib/bbae352. Brief Bioinform. 2024. PMID: 39101486 Free PMC article.
References
-
- Herrel A, Joly D, Danchin E. Epigenetics in ecology and evolution. Hoboken: Wiley Online Library; 2020.
-
- Heiss JA, Brennan KJ, Baccarelli AA, Téllez-Rojo MM, Estrada-Gutiérrez G, Wright RO, Just AC. Battle of epigenetic proportions: comparing illumina’s epic methylation microarrays and truseq targeted bisulfite sequencing. Epigenetics. 2020;15(1–2):174–182. doi: 10.1080/15592294.2019.1656159. - DOI - PMC - PubMed
-
- Betensky RA. The p value requires context, not a threshold. Am Stat. 2019;73(sup1):115–117. doi: 10.1080/00031305.2018.1529624. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials