Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 15;10(1):3113.
doi: 10.1038/s41467-019-10864-z.

Detection of cell-type-specific risk-CpG sites in epigenome-wide association studies

Affiliations

Detection of cell-type-specific risk-CpG sites in epigenome-wide association studies

Xiangyu Luo et al. Nat Commun. .

Abstract

In epigenome-wide association studies, the measured signals for each sample are a mixture of methylation profiles from different cell types. Current approaches to the association detection claim whether a cytosine-phosphate-guanine (CpG) site is associated with the phenotype or not at aggregate level and can suffer from low statistical power. Here, we propose a statistical method, HIgh REsolution (HIRE), which not only improves the power of association detection at aggregate level as compared to the existing methods but also enables the detection of risk-CpG sites for individual cell types.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
A simple cartoon illustration of the HIRE model with three cell types (K = 3) and two phenotypes (disease status and age; q = 2). a Data generation procedure for the observed methylation vector Oi for sample i (i = 1, …, n). In the top panel, Oi is the convolution of cell-type-specific methylation profiles ui with cellular compositions pi. Both ui and pi depend on the attributes of sample i. The bottom panel describes how sample i’s phenotypes affect ui via two phenotype-effect matrices B1 and B2. In B1 and B2, the white square represents zero, which indicates that the phenotype exerts no influence on the corresponding methylation level in ui. b Inputs and outputs of HIRE. We input the observed methylation matrix O, the phenotype data matrix X, and a predetermined cell type number K into HIRE, and HIRE outputs the estimates for the cellular compositions p^, the baseline methylation profiles μ^, the phenotype effects B^, and the penalized BIC value. In addition, HIRE tests whether there is any association between CpG site j and phenotype in cell type kH0:βjk=0 vs H1:βjk0—and provides the p-values
Fig. 2
Fig. 2
Association detection performance of HIRE and commonly used methods in the true alternative setting with K = 3 and n = 180. Source data are provided as a Source Data file. In all figures, red corresponds to HIRE; yellow indicates the unadjusted analysis; brown represents SVA; purple refers to RefFreeEWAS; dark blue indicates EWASher; and light blue corresponds to ReFACTor. a ROC curves of HIRE and commonly used methods. HIRE has the largest area under the curve among all of the methods. b True cell-type-specific association pattern with disease status for 10,000 simulated CpG sites; columns correspond to cell types, and the rows represent the CpG sites. Dark cells correspond to risk-CpG sites, and grey cells are CpG sites not associated with the disease status. c Detected cell-type-specific association pattern with disease status by HIRE. Darkness represents -log10(p-value) di The p-value density plots for association with disease status in the simulation dataset for d HIRE, e unadjusted analysis, f SVA, g RefFreeEWAS, h EWASHer, and i ReFACTor. jo The Q-Q plots for association with disease status for j HIRE, k unadjusted analysis, l SVA, m RefFreeEWAS, n EWASHer, and o ReFACTor
Fig. 3
Fig. 3
Application of HIRE and commonly used methods to two real methylation datasets: RA and GALA II. Source data are provided as a Source Data file. a Cell-type-specic association pattern with RA status detected by HIRE in the RA dataset. Darkness represents the −log10(p−value). b Cell-type-specic association pattern with gender detected by HIRE in the GALA II dataset. The darkness represents the −log10(p−value). ch The p-value density plots for association with RA status in the RA dataset for c HIRE, d unadjusted analysis, e SVA, f RefFreeEWAS, g EWASHer, and h ReFACTor. i-n Q-Q plots for association with RA status in the RA dataset for i HIRE, j unadjusted analysis, k SVA, l RefFreeEWAS, m EWASHer, and n ReFACTor

Similar articles

  • DNA methylation in childhood asthma: an epigenome-wide meta-analysis.
    Xu CJ, Söderhäll C, Bustamante M, Baïz N, Gruzieva O, Gehring U, Mason D, Chatzi L, Basterrechea M, Llop S, Torrent M, Forastiere F, Fantini MP, Carlsen KCL, Haahtela T, Morin A, Kerkhof M, Merid SK, van Rijkom B, Jankipersadsing SA, Bonder MJ, Ballereau S, Vermeulen CJ, Aguirre-Gamboa R, de Jongste JC, Smit HA, Kumar A, Pershagen G, Guerra S, Garcia-Aymerich J, Greco D, Reinius L, McEachan RRC, Azad R, Hovland V, Mowinckel P, Alenius H, Fyhrquist N, Lemonnier N, Pellet J, Auffray C; BIOS Consortium; van der Vlies P, van Diemen CC, Li Y, Wijmenga C, Netea MG, Moffatt MF, Cookson WOCM, Anto JM, Bousquet J, Laatikainen T, Laprise C, Carlsen KH, Gori D, Porta D, Iñiguez C, Bilbao JR, Kogevinas M, Wright J, Brunekreef B, Kere J, Nawijn MC, Annesi-Maesano I, Sunyer J, Melén E, Koppelman GH. Xu CJ, et al. Lancet Respir Med. 2018 May;6(5):379-388. doi: 10.1016/S2213-2600(18)30052-3. Epub 2018 Feb 26. Lancet Respir Med. 2018. PMID: 29496485
  • Epigenome-wide association study of DNA methylation and adult asthma in the Agricultural Lung Health Study.
    Hoang TT, Sikdar S, Xu CJ, Lee MK, Cardwell J, Forno E, Imboden M, Jeong A, Madore AM, Qi C, Wang T, Bennett BD, Ward JM, Parks CG, Beane-Freeman LE, King D, Motsinger-Reif A, Umbach DM, Wyss AB, Schwartz DA, Celedón JC, Laprise C, Ober C, Probst-Hensch N, Yang IV, Koppelman GH, London SJ. Hoang TT, et al. Eur Respir J. 2020 Sep 3;56(3):2000217. doi: 10.1183/13993003.00217-2020. Print 2020 Sep. Eur Respir J. 2020. PMID: 32381493 Free PMC article.
  • Correcting for Sample Heterogeneity in Methylome-Wide Association Studies.
    Zou JY. Zou JY. Methods Mol Biol. 2017;1589:107-114. doi: 10.1007/7651_2015_266. Methods Mol Biol. 2017. PMID: 26246354
  • Epigenome-wide association studies in asthma: A systematic review.
    Edris A, den Dekker HT, Melén E, Lahousse L. Edris A, et al. Clin Exp Allergy. 2019 Jul;49(7):953-968. doi: 10.1111/cea.13403. Epub 2019 May 14. Clin Exp Allergy. 2019. PMID: 31009112
  • Recommendations for the design and analysis of epigenome-wide association studies.
    Michels KB, Binder AM, Dedeurwaerder S, Epstein CB, Greally JM, Gut I, Houseman EA, Izzi B, Kelsey KT, Meissner A, Milosavljevic A, Siegmund KD, Bock C, Irizarry RA. Michels KB, et al. Nat Methods. 2013 Oct;10(10):949-55. doi: 10.1038/nmeth.2632. Nat Methods. 2013. PMID: 24076989 Review.

Cited by

References

    1. Rakyan VK, Down TA, Balding DJ, Beck S. Epigenome-wide association studies for common human diseases. Nat. Rev. Genet. 2011;12:529–541. doi: 10.1038/nrg3000. - DOI - PMC - PubMed
    1. Verma M. Epigenome-wide association studies (EWAS) in cancer. Curr. Genom. 2012;13:308–313. doi: 10.2174/138920212800793294. - DOI - PMC - PubMed
    1. Liu Y, et al. Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. Nat. Biotechnol. 2013;31:142–147. doi: 10.1038/nbt.2487. - DOI - PMC - PubMed
    1. Gao X, Jia M, Zhang Y, Breitling LP, Brenner H. DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies. Clin. Epigenetics. 2015;7:113. doi: 10.1186/s13148-015-0148-3. - DOI - PMC - PubMed
    1. Joehanes R, et al. Epigenetic signatures of cigarette smoking. Circ. Cardiovasc. Genet. 2016;9:436–447. doi: 10.1161/CIRCGENETICS.116.001506. - DOI - PMC - PubMed

Publication types