Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Aug;8(8):816-26.
doi: 10.4161/epi.25430. Epub 2013 Jun 25.

Blood-based profiles of DNA methylation predict the underlying distribution of cell types: a validation analysis

Affiliations

Blood-based profiles of DNA methylation predict the underlying distribution of cell types: a validation analysis

Devin C Koestler et al. Epigenetics. 2013 Aug.

Abstract

The potential influence of underlying differences in relative leukocyte distributions in studies involving blood-based profiling of DNA methylation is well recognized and has prompted development of a set of statistical methods for inferring changes in the distribution of white blood cells using DNA methylation signatures. However, the extent to which this methodology can accurately predict cell-type proportions based on blood-derived DNA methylation data in a large-scale epigenome-wide association study (EWAS) has yet to be examined. We used publicly available data deposited in the Gene Expression Omnibus (GEO) database (accession number GSE37008), which consisted of both blood-derived epigenome-wide DNA methylation data assayed using the Illumina Infinium HumanMethylation27 BeadArray and complete blood cell (CBC) counts among a community cohort of 94 non-diseased individuals. Constrained projection (CP) was used to obtain predictions of the proportions of lymphocytes, monocytes and granulocytes for each of the study samples based on their DNA methylation signatures. Our findings demonstrated high consistency between the average CBC-derived and predicted percentage of monocytes and lymphocytes (17.9% and 17.6% for monocytes and 82.1% and 81.4% for lymphocytes), with root mean squared error (rMSE) of 5% and 6%, for monocytes and lymphocytes, respectively. Similarly, there was moderate-high correlation between the CP-predicted and CBC-derived percentages of monocytes and lymphocytes (0.60 and 0.61, respectively), and these results were robust to the number of leukocyte differentially methylated regions (L-DMRs) used for CP prediction. These results serve as further validation of the CP approach and highlight the promise of this technique for EWAS where DNA methylation is profiled using whole-blood genomic DNA.

Keywords: DNA methylation; cell mixture analysis; leukocytes; mixture deconvolution; whole-blood.

PubMed Disclaimer

Figures

None
Figure 1. Illustration of the blood cell mixture deconvolution approach. This approach involves, (A) constrained projection of DNA methylation profiles from a target methylation data set (S1) onto a reference data set (S0 ), which is comprised of the DNA methylation signatures for isolated white blood cell types (shapes reflect different white blood cell types). The result is an estimate of the underlying distribution of cell proportions (circle, triangle and hexagon) for each sample within S1. (B) This approach assumes that the methylation signature for samples within S1 are the weighted sum of the methylation signatures from individual white blood cell types, where the weights are proportional to the cell-type frequencies.
None
Figure 2. Complete blood cell (CBC) and predicted proportions of white blood cell types in the target methylation data set. CBC derived proportions [i.e., Ω(CBC)] of white blood cell types in (A) whole-blood and (B) peripheral blood mononuclear cell (PBMCs) (i.e., devoid of granulocytes) for the samples in the target methylation data set. (C) Predicted proportions (i.e., ) of CD8+ T-lymphocytes (CD8T), CD4+ T-lymphocytes (CD4T), Natural killer cells (NK), B cells (Bcell), Monocytes (Mono) and Granulocytes (Gran) for the target samples using constrained projection (CP). Black bars denote the median and the red dashed bars denote the 75th and 25th percentiles for the predicted cell-type proportions. Colored points indicate subjects with replicate samples, where two points of the same color denote replicate samples for the same subject.
None
Figure 3. Comparison of the predicted and CBC derived proportions of monocytes and lymphocytes among the target samples. Scatter-plot of the predicted and CBC-derived proportions of (A) monocytes and (B) lymphocytes. Solid red lines represent the unity lines (i.e., y = x). Bland–Altman plots for (C) monocyte and (D) lymphocyte proportions. Y-axes represent the difference in the predicted and CBC-derived cell-type proportions, and X-axes represent the mean cell-type proportions based on CP prediction and CBC-based proportions. Red-dotted lines indicate the global bootstrap-based 95% prediction intervals for the difference in predicted and CBC-derived cell-type proportions.
None
Figure 4. Prediction performance as a function of the number of L-DMRs used in CP. (A) Pearson correlation between the predicted and CBC-derived proportions of monocytes (blue line) and lymphocytes (red line) as a function of the numbers of L-DMRs used in CP. (B) root mean squared error (rMSE) for monocytes and lymphocytes and (C) median (%) granulocytes as a function of the numbers of L-DMRs used in CP. (D) Pearson correlation between the predicted and CBC-derived proportions of monocytes and lymphocytes as a function of the numbers of non L-DMRs (negative controls) used in CP.

References

    1. Rakyan VK, Down TA, Thorne NP, Flicek P, Kulesha E, Gräf S, et al. An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs) Genome Res. 2008;18:1518–29. doi: 10.1101/gr.077479.108. - DOI - PMC - PubMed
    1. Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, Wiemels JL, et al. Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 2009;5:e1000602. doi: 10.1371/journal.pgen.1000602. - DOI - PMC - PubMed
    1. Baron U, Türbachova I, Hellwag A, Eckhardt F, Berlin K, Hoffmuller U, et al. DNA methylation analysis as a tool for cell typing. Epigenetics. 2006;1:55–60. doi: 10.4161/epi.1.1.2643. - DOI - PubMed
    1. Bocker MT, Hellwig I, Breiling A, Eckstein V, Ho AD, Lyko F. Genome-wide promoter DNA methylation dynamics of human hematopoietic progenitor cells during differentiation and aging. Blood. 2011;117:e182–9. doi: 10.1182/blood-2011-01-331926. - DOI - PubMed
    1. Ji H, Ehrlich LI, Seita J, Murakami P, Doi A, Lindau P, et al. Comprehensive methylome map of lineage commitment from haematopoietic progenitors. Nature. 2010;467:338–42. doi: 10.1038/nature09367. - DOI - PMC - PubMed

Publication types