Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2019 Mar 25;20(2):585-597.
doi: 10.1093/bib/bby029.

Disease prediction by cell-free DNA methylation

Affiliations
Review

Disease prediction by cell-free DNA methylation

Hao Feng et al. Brief Bioinform. .

Abstract

Disease diagnosis using cell-free DNA (cfDNA) has been an active research field recently. Most existing approaches perform diagnosis based on the detection of sequence variants on cfDNA; thus, their applications are limited to diseases associated with high mutation rate such as cancer. Recent developments start to exploit the epigenetic information on cfDNA, which could have substantially wider applications. In this work, we provide thorough reviews and discussions on the statistical method developments and data analysis strategies for using cfDNA epigenetic profiles, in particular DNA methylation, to construct disease diagnostic models. We focus on two important aspects: marker selection and prediction model construction, under different scenarios. We perform simulations and real data analysis to compare different approaches, and provide recommendations for data analysis.

Keywords: DNA methylation; cell-free DNA; epigenetics; liquid biopsy; marker selection; predictive modeling.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic overview of plasma cfDNA methylation mixing procedure and deconvolution methods for disease detection and monitoring. One straightforward approach is to use marker directly for disease detection. Besides using biomarkers directly, signal deconvolution methods can be categorized into either the ‘reference-free approach’ when the external tissue-specific methylation reference in unavailable, or the ‘reference-based approach’ when the tissue-specific profile is known.
Figure 2
Figure 2
Boxplot of classification accuracies for multiple methods in simulations. Marker: Marker directly predict approach. QP: using tissue proportions solved from QP procedure for prediction. NMF approach. True Prop: using simulated true proportion in classification. A total number of 20 simulations are conducted. (A) Low noise level; (B) medium noise level; (C) high noise level.
Figure 3
Figure 3
Scatterplots of NMF estimated reference methylation levels versus true reference methylation levels in four tissues. (A) Small intestines; (B) adipose tissues; (C) adrenal glands; (D) lungs. Relatively strong correlations are observed. Spearman’s correlation is shown in each panel.
Figure 4
Figure 4
Scatterplots of NMF estimated tissue proportions versus true tissue proportions in four tissues. (A) Small intestines; (B) adipose tissues; (C) adrenal glands; (D) lungs. Relatively strong correlations are observed. Spearman’s correlation is shown in each panel.
Figure 5
Figure 5
Barplot for the estimated 14 tissue proportions from real data for HCC patients, healthy controls and pregnant subjects, using QP with external reference available. HCC patients showed an increased proportion of cfDNA originating from liver, while pregnant controls showed an increased proportion of cfDNA originating from placenta.
Figure 6
Figure 6
Boxplot of real data solved tissue proportions for liver and placenta, respectively, among three groups. (A) Tissue proportions for liver among three groups; (B) tissue proportions for placenta among three groups.

References

    1. Sgouros G. Bone marrow dosimetry for radioimmunotherapy: theoretical considerations. J Nucl Med 1993;34(4):689–94. - PubMed
    1. Biomarkers Definitions Working Group. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther 2001;69:89–95. - PubMed
    1. Tibshirani R, Hastie T, Narasimhan B, et al.Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci USA 2002;99(10):6567–72. - PMC - PubMed
    1. Parker JS, Mullins M, Cheang MC, et al.Supervised risk predictor of breast cancer based on intrinsic subtypes. J Clin Oncol 2009;27:1160–7. - PMC - PubMed
    1. Zilliox MJ, Irizarry RA.. A gene expression bar code for microarray data. Nat Methods 2007;4(11):911–13. - PMC - PubMed

Publication types