Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr 7;45(6):e44.
doi: 10.1093/nar/gkw1193.

QSEA-modelling of genome-wide DNA methylation from sequencing enrichment experiments

Affiliations

QSEA-modelling of genome-wide DNA methylation from sequencing enrichment experiments

Matthias Lienhard et al. Nucleic Acids Res. .

Abstract

Genome-wide enrichment of methylated DNA followed by sequencing (MeDIP-seq) offers a reasonable compromise between experimental costs and genomic coverage. However, the computational analysis of these experiments is complex, and quantification of the enrichment signals in terms of absolute levels of methylation requires specific transformation. In this work, we present QSEA, Quantitative Sequence Enrichment Analysis, a comprehensive workflow for the modelling and subsequent quantification of MeDIP-seq data. As the central part of the workflow we have developed a Bayesian statistical model that transforms the enrichment read counts to absolute levels of methylation and, thus, enhances interpretability and facilitates comparison with other methylation assays. We suggest several calibration strategies for the critical parameters of the model, either using additional data or fairly general assumptions. By comparing the results with bisulfite sequencing (BS) validation data, we show the improvement of QSEA over existing methods. Additionally, we generated a clinically relevant benchmark data set consisting of methylation enrichment experiments (MeDIP-seq), BS-based validation experiments (Methyl-seq) as well as gene expression experiments (RNA-seq) derived from non-small cell lung cancer patients, and show that the workflow retrieves well-known lung tumour methylation markers that are causative for gene expression changes, demonstrating the applicability of QSEA for clinical studies. QSEA is implemented in R and available from the Bioconductor repository 3.4 (www.bioconductor.org/packages/qsea).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Overview of the QSEA workflow. Green boxes represent data input, functions implemented in QSEA are depicted in blue, and red boxes describe the respective analysis step performed within these functions.
Figure 2.
Figure 2.
Modelling MeDIP enrichment. (A) MeDIP read density (depicted exemplary for normal sample from patient 3) has linear relation to methylation level, and (B) gets saturated for high CpG density. (C) CpG dependent enrichment profile plot shows heat-color coded density of MeDIP reads by CpG density, the mean MeDIP coverage (red line), the observed MeDIP coverage at fully methylated regions (black line), and the sigmoidal function fitted to the coverage of fully methylated regions (dashed green line). (D) Exemplary illustration of methylation estimates depending on MeDIP read density, assuming four background reads and 25 reads at fully methylated windows.
Figure 3.
Figure 3.
Benchmark and comparison to alternative methods. (A) Spearman correlation of QSEA, BayMeth and MeSiC estimates vs BS-seq for IMR-90 cell line. (B) High density scatterplots of MeDIP methylation estimates and 450k methylation levels for IMR 90 cell line. (C) Spearman correlation of QSEA, BayMeth and MeSiC estimates vs Methyl-Seq for chr1 of PDX samples. (D) Clustering of methylation differences between PDX and normal tissue for QSEA estimates.
Figure 4.
Figure 4.
Validation and functional interpretation of differentially methylated Regions: (A) Scatterplot of methylation difference from QSEA methylation estimate and Methyl-Seq of CpG island promoter DMRs. Blue: directly covered by Methyl-Seq. Orange: neighbourhood covered by Methyl-Seq, red: not covered by Methyl-Seq. (B) Methylation difference of 20 most hypermethylated tumour suppressor genes form QSEA estimates (red) and Methyl-Seq(blue). (C) Histogram of correlation between CpG island promoter methylation and gene expression. (D) Mean methylation difference versus gene expression log2 ratio of differentially expressed genes with differentially methylated CpG island promoter. Red numbers are gene counts per quadrant. (E) Gene expression in four NSCLC cell lines after demethylation validation experiment.
Figure 5.
Figure 5.
Enrichment of differentially methylated regions. (A) Sixteen most enriched transcription factor binding sites for hypermethylated regions and (B) hypomethylated regions respectively. (C) Tumour-Normal methylation differences chr22 show large hypomethylated blocks (LHB). Black dots represent mean methylation difference between PDX and normal, dashed lines are smoothed methylation differences for individual patients. Violet bars indicate two LHB, abundant in most of the tumour samples.

References

    1. Bird A. DNA methylation patterns and epigenetic memory. Genes Dev. 2002; 16:6–21. - PubMed
    1. Hanahan D., Weinberg R.A.. Hallmarks of cancer: the next generation. Cell. 2011; 144:646–674. - PubMed
    1. Esteller M. CpG island hypermethylation and tumor suppressor genes: a booming present, a brighter future. Oncogene. 2002; 21:5427–5440. - PubMed
    1. deVos T., Tetzner R., Model F., Weiss G., Schuster M., Distler J., Steiger K.V., Grutzmann R., Pilarsky C., Habermann J.K. et al. . Circulating methylated SEPT9 DNA in plasma is a biomarker for colorectal cancer. Clin. Chem. 2009; 55:1337–1346. - PubMed
    1. Heyn H., Esteller M.. DNA methylation profiling in the clinic: applications and challenges. Nat. Rev. Genet. 2012; 13:679–692. - PubMed

Publication types