Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov 16;44(20):e153.
doi: 10.1093/nar/gkw680. Epub 2016 Aug 2.

Differential peak calling of ChIP-seq signals with replicates with THOR

Affiliations

Differential peak calling of ChIP-seq signals with replicates with THOR

Manuel Allhoff et al. Nucleic Acids Res. .

Abstract

The study of changes in protein-DNA interactions measured by ChIP-seq on dynamic systems, such as cell differentiation, response to treatments or the comparison of healthy and diseased individuals, is still an open challenge. There are few computational methods comparing changes in ChIP-seq signals with replicates. Moreover, none of these previous approaches addresses ChIP-seq specific experimental artefacts arising from studies with biological replicates. We propose THOR, a Hidden Markov Model based approach, to detect differential peaks between pairs of biological conditions with replicates. THOR provides all pre- and post-processing steps required in ChIP-seq analyses. Moreover, we propose a novel normalization approach based on housekeeping genes to deal with cases where replicates have distinct signal-to-noise ratios. To evaluate differential peak calling methods, we delineate a methodology using both biological and simulated data. This includes an evaluation procedure that associates differential peaks with changes in gene expression as well as histone modifications close to these peaks. We evaluate THOR and seven competing methods on data sets with distinct characteristics from in vitro studies with technical replicates to clinical studies of cancer patients. Our evaluation analysis comprises of 13 comparisons between pairs of biological conditions. We show that THOR performs best in all scenarios.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
THOR's analysis workflow. (A) After pre-processing the ChIP-seq signal, normalization based on housekeeping genes (as shown) or TMM is performed. The normalized signal serves as input for the HMM which is used to estimate DPs. Post-processing includes the statistical assessment of DPs. (B) List of all competing methods categorized in one-stage and two-stage approaches. (C) Evaluation strategies. We evaluate DPC methods with biological data and the DCA statistic that is based on the association between DPs, gene expression and histone modifications. Moreover, simulated data are used to investigate the effect of distinct ChIP-seq signal characteristics on DPCs methods.
Figure 2.
Figure 2.
Results for simulated data. We show the AUC distribution for 25 repetitions of each scenario. Simulated data were based on (A) moderate and (B) high condition peak size variability and 2 (red lines) and 4 (green lines) replicates. Each boxplot is divided by the level of within condition variance (low, medium and high). Methods (x-axis) are ordered by decreasing median AUC values for the cases with 4 replicates.
Figure 3.
Figure 3.
Association between average FRIP and overdispersion scores α. (A–B) We show the relation between mean and variance of replicates on two selected experimental conditions (LYMP-CC) and (DC-cDC). (C) FRIP and overdispersion scores for the 26 biological conditions analysed: cocaine intake (CO), monocyte differentiation (MM), lymphoid cancer (LYMP) and dendritic cell differentiation (DC). Higher FRIP indicates higher signal-to-noise ratio and better ChIP-seq experiments. Higher overdispersion scores indicates higher within condition variability.
Figure 4.
Figure 4.
DCA curves for selected DP problems. We show DCA curves for peaks detected in H3K4me1 of the Cocaine Response study (CO-H3K4me1), which were evaluated by (A) RNA-seq and (B) H3K36me3; or H3K27ac peaks of the Lymphoma study (LYMP-FL-CC-H3K27ac), which were evaluated by (C) microarrays and (D) H3ac histone. Higher DCA values indicate higher association between differential peaks and differential expression or validating histones.
Figure 5.
Figure 5.
Example of differential peaks. We depict H3K4me3 and RNA-seq signals for monocytes (red) and macrophages (green) around the promoter of (A) IRAK3 and PDK2; (B) H3K27ac signals around CRCX4 for Follicular Lymphoma (FL) and control (CC) individuals; and (C) Pol2 signal around Dvl1 for mouse treated with cocaine (green) and saline (red); Below the ChIP-seq signals, we depict differential peaks of all evaluated methods. Methods that do not detect peaks for a given experiment are not listed.

References

    1. Dahl J.A., Collas P. MicroChIP–a rapid micro chromatin immunoprecipitation assay for small cell samples and biopsies. Nucleic Acids Res. 2008;36:e15. - PMC - PubMed
    1. Dunham I., Kundaje A., Aldred S. F., Collins P. J., Davis C. A, Doyle F., Lochovsky L. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. - PMC - PubMed
    1. Kundaje A., Meuleman W., Ernst J., Bilenky M., Yen A., Heravi-Moussavi A., Kellis M. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. - PMC - PubMed
    1. Martens J.H.A., Stunnenberg H.G. BLUEPRINT: mapping human blood cell epigenomes. Haematologica. 2013;98:1487–1489. - PMC - PubMed
    1. Lara-Astiaso D., Weiner A., Lorenzo-Vivas E., Zaretsky I., Jaitin D.A., David E., Keren-Shaul H., Mildner A., Winter D., Jung S., et al. Immunogenetics. Chromatin state dynamics during blood formation. Science. 2014;345:943–949. - PMC - PubMed