Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 25;18(1):17.
doi: 10.1186/s13059-016-1143-5.

Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies

Affiliations

Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies

Xiaoqi Zheng et al. Genome Biol. .

Abstract

We present a set of statistical methods for the analysis of DNA methylation microarray data, which account for tumor purity. These methods are an extension of our previously developed method for purity estimation; our updated method is flexible, efficient, and does not require data from reference samples or matched normal controls. We also present a method for incorporating purity information for differential methylation analysis. In addition, we propose a control-free differential methylation calling method when normal controls are not available. Extensive analyses of TCGA data demonstrate that our methods provide accurate results. All methods are implemented in InfiniumPurify.

Keywords: Cancer epigenomics; DNA methylation; Differential methylation; Tumor purity.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
A flowchart to illustrate the InfiniumPurify algorithm, including purity estimation and DM calling procedures
Fig. 2
Fig. 2
Purity estimates from TCGA data. a Scatter plots showing comparison of purities for all TCGA tumor samples from InfiniumPurify with ESTIMATE, ABSOLUTE, LUMP, IHC, and CPE, respectively. b Correlations between InfiniumPurify and other estimates for all TCGA cancer types. c Distribution of estimated tumor purities from InfiniumPurify for all TCGA cancer types
Fig. 3
Fig. 3
Correlations between tumor purity and methylation levels are high for DMCs. a Distribution densities of correlations between tumor purity and methylation levels for all CpG sites, from observed and randomly sampled data, based on LUAD data. b Boxplots of correlations, stratified by rank-sum test statistics. c Boxplots of rank-sum test statistics, stratified by correlations
Fig. 4
Fig. 4
Differential methylation analysis results, with normal control. a Numbers of differential methylated CpG sites (FDR < 0.01). b Spatial correlations among test statistics from nearby CpG sites. c Average pan-cancer correlation of test statistics. d Enrichment p values for top 1000 differentially methylated genes within “PATHWAY-IN-CANCER” from KEGG
Fig. 5
Fig. 5
Differential methylation analysis, without normal control. a Examples of ROC curves from InfiniumPurify control-free DMC calling model, where results from tumor-normal comparison are treated as gold standard. b AUCs for selected cancer types in TCGA. c Heatmap showing overlaps of the top 50,000 DMCs among different cancer types

Similar articles

Cited by

References

    1. Jones PA. DNA methylation and cancer. Cancer Res. 1986;46:461–6. - PubMed
    1. Kulis M, Esteller M. DNA methylation and cancer. Adv Genet. 2010;70:27–56. - PubMed
    1. Laird PW. The power and the promise of DNA methylation markers. Nat Rev Cancer. 2003;3:253–66. doi: 10.1038/nrc1045. - DOI - PubMed
    1. Bibikova M, Barnes B, Tsan C, Ho V, Klotzle B, Le JM, et al. High density DNA methylation array with single CpG site resolution. Genomics. 2011;98:288–95. doi: 10.1016/j.ygeno.2011.07.007. - DOI - PubMed
    1. Lister R, Pelizzola M, Dowen RH, Hawkins RD, Hon G, Tonti-Filippini J, et al. Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009;462:315–22. doi: 10.1038/nature08514. - DOI - PMC - PubMed

Publication types