Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 16;5(1):101349.
doi: 10.1016/j.xcrm.2023.101349. Epub 2023 Dec 20.

Multi-modal cell-free DNA genomic and fragmentomic patterns enhance cancer survival and recurrence analysis

Affiliations

Multi-modal cell-free DNA genomic and fragmentomic patterns enhance cancer survival and recurrence analysis

Norbert Moldovan et al. Cell Rep Med. .

Abstract

The structure of cell-free DNA (cfDNA) is altered in the blood of patients with cancer. From whole-genome sequencing, we retrieve the cfDNA fragment-end composition using a new software (FrEIA [fragment end integrated analysis]), as well as the cfDNA size and tumor fraction in three independent cohorts (n = 925 cancer from >10 types and 321 control samples). At 95% specificity, we detect 72% cancer samples using at least one cfDNA measure, including 64% early-stage cancer (n = 220). cfDNA detection correlates with a shorter overall (p = 0.0086) and recurrence-free (p = 0.017) survival in patients with resectable esophageal adenocarcinoma. Integrating cfDNA measures with machine learning in an independent test set (n = 396 cancer, 90 controls) achieve a detection accuracy of 82% and area under the receiver operating characteristic curve of 0.96. In conclusion, harnessing the biological features of cfDNA can improve, at no extra cost, the diagnostic performance of liquid biopsies.

Keywords: cancer; cell-free DNA; fragmentomics; liquid biopsy; multi-modal; sequencing.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests F.M. is co-inventor on multiple patents related to cfDNA analysis. Other co-authors have no relevant conflict of interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Measures of cfDNA biological features are altered in cancer (A) The number of cancer, nodule, and control samples, the biological signatures of cfDNA, and the extracted measures used in this study. (B) Log10 cancer/control fold changes (FCs) of the 5′ trinucleotide fragment end sequence proportions. Trinucleotides with a p < 0.01 and a log10FC below the 25th percentile (red) or above the 75th percentile (blue) are shown. (C) The log10FC of trinucleotides significantly altered (∗p < 0.01) in various cancer types pre-treatment. (D and E) The increase in (D) the FrEIA score and (E) the Gini diversity index by cancer type in pre-treatment samples. (F) Aberrant normalized size distribution of cfDNA fragments in pre-treatment cancer samples compared to control samples. The vertical dashed lines outline the size interval used to calculate the P20-150 measure. (G and H) The (G) P20-150 and (H) the ichorCNA TF increased by cancer type in pre-treatment samples. Bd, bile duct cancer; Br, breast cancer; Cr, colorectal cancer; Es, esophageal cancer; Ga, gastric cancer; Gl, glioblastoma; Lu, lung cancer; Ov, ovarian cancer; Pa, pancreatic cancer. Numbers below the cancer type abbreviation represent the sample count. Cancer types with less than 10 samples are in the “other” category. p values were calculated using two-sided Mann-Whitney U test: ns, not significant, ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.005, ∗∗∗∗p < 0.001. When multiple hypotheses were tested, alpha values were adjusted using the Bonferroni method. #, mean passing the threshold of 3% tumor fraction. No biological or technical replicates were used.
Figure 2
Figure 2
cfDNA biological signatures from xenograft mouse models (n = 16) grafted with a human colorectal cancer cell line (A) The schematic of the workflow. Signatures were computed from the reads aligning to the human reference genome (GRCh38; graft). (B) Spearman correlation of trinucleotide fragment-end proportions from the patient-derived samples (cancer) and the xenograft-derived samples (graft). Trinucleotides with a p < 0.01 and a log10FC below the 25th percentile (red) or above the 75th percentile (blue) are shown, as computed from the patient-derived samples with a tumor fraction >10% (see Figure 1B). (C–E) The increase in (C) the FrEIA score, (D) the Gini diversity index, and (E) the proportion of short fragments (P20-150) of the graft (in magenta) compared to patient-derived control and cancer samples. No biological or technical replicates were used.
Figure 3
Figure 3
Correlation between cfDNA biological features and with physiological variables (A) Spearman correlation between cfDNA measures in pre-treatment cancer samples. (B) Spearman correlation of cfDNA biological variables with the mutant allele fraction, where available (n = 196 samples). (C) Spearman correlation between cfDNA measures in control samples. ns, not significant, other values p < 0.01. (D–F) Spearman correlation of age and (D) the FrEIA score, (E) the Gini diversity index, and (F) the P20-150 of controls. (G–I) The FrEIA score (G), the Gini diversity index (H), and the P20-150 (I) by gender of control individuals. p values were calculated using two-sided Mann-Whitney U test: ns, not significant, ∗∗∗∗p < 0.001. No biological or technical replicates were used.
Figure 4
Figure 4
Cancer detection and classification using cfDNA biological features (A) Receiver operating characteristic (ROC) curve of the detection performance of pre-treatment samples using distinct cfDNA measures individually or in combination (all metrics). The vertical dashed line marks 95% specificity. (B) The proportion of detected pre-treatment lung and esophageal adenocarcinoma samples by stage. The numbers below the stages represent the detection rate. (C) Detection rates by at least one of the cfDNA measure or by the MAF, where available, of pre-treatment samples (n = 196 samples). (D) Schematic representation of the machine learning approach. (E) ROC curve from predictions on an independent dataset of a logistic regression classifier based on individual or the combination of cfDNA measures. The vertical dashed line marks 95% specificity. (F) Prediction probabilities of the logistic regression classifier of pre-treatment lung and esophageal adenocarcinoma samples by stage. Samples above the detection threshold (the horizontal dashed line) are considered detected. C, controls; NC, nodules; I, stage I; II, stage II; III, stage III; IV, stage IV. Numbers below the stages represent detection rates. No biological or technical replicates were used.
Figure 5
Figure 5
cfDNA biological patterns enable monitoring and prediction of recurrence for esophageal carcinoma (A) Schematic representation of the clinical timeline and sampling of patients with EAC. (B) The change in FrEIA score between pre-CRT and post-CRT samples based on the pathological complete response (pCR). p values were calculated using two-sided Mann-Whitney U test: ns, not significant, ∗∗p < 0.01, ∗∗∗p < 0.005. (C) Clinical timeline of patients with EAC undergoing resection from the PERFECT subcohort (n = 33) centered around the time point of resection. EOT, end of treatment; CSD, cancer-specific death. (D) Kaplan-Meier curves of the recurrence-free survival probabilities for patients with EAC from the PERFECT subcohort from the postresection time point. Samples with one of the measures higher than the threshold were considered “detected” (FrEIA score: 2.54, Gini diversity index: 0.98, P20-150: 0.26, ichorCNA: 3% sensitivity threshold). p values were calculated using log-rank test statistics. Dashed lines show the median survival time. No biological or technical replicates were used.
Figure 6
Figure 6
Integrating cfDNA biological patterns improve survival prognostication (A) Kaplan-Meier curves of the survival probability for patients with EAC from the postsurgery time point. Dashed lines represent the median survival. (B) Kaplan-Meier curves of the survival probability for patients with lung cancer from the time of initial sampling. Samples with one of the measures higher than the threshold were considered “detected” (FrEIA score: 2.54, Gini diversity index: 0.98, P20-150: 0.26, ichorCNA: 3% sensitivity threshold). p values were calculated using log-rank test statistics. Dashed lines show the median survival time. No biological or technical replicates were used.

References

    1. Bettegowda C., Sausen M., Leary R.J., Kinde I., Wang Y., Agrawal N., Bartlett B.R., Wang H., Luber B., Alani R.M., et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci. Transl. Med. 2014;6:224ra24. - PMC - PubMed
    1. Garcia-Murillas I., Schiavon G., Weigelt B., Ng C., Hrebien S., Cutts R.J., Cheang M., Osin P., Nerurkar A., Kozarewa I., et al. Mutation tracking in circulating tumor DNA predicts relapse in early breast cancer. Sci. Transl. Med. 2015;7:302ra133. - PubMed
    1. Heitzer E., Auer M., Hoffmann E.M., Pichler M., Gasch C., Ulz P., Lax S., Waldispuehl-Geigl J., Mauermann O., Mohan S., et al. Establishment of tumor-specific copy number alterations from plasma DNA of patients with cancer. Int. J. Cancer. 2013;133:346–356. - PMC - PubMed
    1. Creemers A., Krausz S., Strijker M., van der Wel M.J., Soer E.C., Reinten R.J., Besselink M.G., Wilmink J.W., van de Vijver M.J., van Noesel C.J.M., et al. Clinical value of ctDNA in upper-GI cancers: A systematic review and meta-analysis. Biochim. Biophys. Acta Rev. Canc. 2017;1868:394–403. - PubMed
    1. Razavi P., Li B.T., Brown D.N., Jung B., Hubbell E., Shen R., Abida W., Juluru K., De Bruijn I., Hou C., et al. High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants. Nat. Med. 2019;25:1928–1937. - PMC - PubMed

Publication types