Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Dec 12:rs.3.rs-5450972.
doi: 10.21203/rs.3.rs-5450972/v1.

Long-read epigenomic diagnosis and prognosis of Acute Myeloid Leukemia

Affiliations

Long-read epigenomic diagnosis and prognosis of Acute Myeloid Leukemia

Jatinder Lamba et al. Res Sq. .

Update in

  • Epigenomic diagnosis and prognosis of Acute Myeloid Leukemia.
    Marchi F, Shastri VM, Marrero RJ, Nguyen NHK, Öttl A, Schade AK, Landwehr M, Krali O, Nordlund J, Ghavami M, Sckaff F, Mansinghka VK, Cao X, Slayton W, Starostik P, Cogle CR, Ribeiro RC, Rubnitz JE, Klco J, Elsayed A, Gamis AS, Triche TJ Jr, Ries R, Kolb EA, Aplenc R, Alonzo T, Pounds S, Meshinchi S, Lamba JK. Marchi F, et al. Nat Commun. 2025 Jul 29;16(1):6961. doi: 10.1038/s41467-025-62005-4. Nat Commun. 2025. PMID: 40730747 Free PMC article.

Abstract

Acute Myeloid Leukemia (AML) is an aggressive cancer with dismal outcomes, vast subtype heterogeneity, and suboptimal risk stratification. In this study, we harmonized DNA methylation data from 3,314 patients across 11 cohorts to develop the Acute Leukemia Methylome Atlas (ALMA) of diagnostic relevance that predicted 27 WHO 2022 acute leukemia subtypes with an overall accuracy of 96.3% in discovery and 90.1% in validation cohorts. Specifically, for AML, we also developed AML Epigenomic Risk, a prognostic classifier of overall survival (OS) (HR=4.40; 95% CI=3.45-5.61; P<0.0001), and a targeted 38CpG AML signature using a stepwise EWAS-CoxPH-LASSO model predictive of OS (HR=3.84; 95% CI=3.01-4.91; P<0.0001). Finally, we developed a specimen-to-result protocol for simultaneous whole-genome and epigenome sequencing that accurately predicted diagnoses and prognoses from twelve prospectively collected patient samples using long-read sequencing. Our study unveils a new paradigm in acute leukemia management by leveraging DNA methylation for diagnostic and prognostic applications.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The Acute Leukemia Methylome Atlas Study.
A) Overall design of computational models. B) Acute Leukemia Methylome Atlas (ALMA) defined by dimension reduction algorithm PaCMAP in the overall population. Each data point is a patient sample deriving from bone marrow or peripheral blood at diagnosis, relapse, or remission. As an unsupervised model, only 5mC values of 331,556 CpGs were used to define the clusters. C) Clinical implementation of long-read nanopore sequencing pipeline. AML, acute myeloid leukemia; ALL, acute lymphoblastic leukemia; MDS, myelodysplastic syndrome; MPAL, mixed phenotype acute leukemia; APL, acute promyelocytic leukemia; PaCMAP, pairwise controlled manifold approximation; WHO, World Health Organization; PB, peripheral blood; BM, bone marrow; EDTA, Ethylenediaminetetraacetic acid; HMW gDNA, high molecular weight genomic DNA.
Figure 2
Figure 2. ALMA Subtype classification according to WHO 2022 subtypes.
Sankey diagram displaying discovery cohort comparison between WHO 2022 diagnosis (left) and ALMA Subtype (right) for samples with A)or without B) WHO 2022 clinical annotation available. The width of the bands indicates the number of patient samples in each category. Comparison applied to validation cohort describing samples with C) or without D) WHO 2022 annotation available. E)Receiver operating characteristic (ROC) curves for the discovery cohort (n=2471) and F) validation cohort (bottom, n=200). The curves depict the true positive rate against the false positive rate for model predictions. Specific per-class AUC results are available in the electronic notebook. G)Table summarizing classification performance metrics: accuracy, macro F1, weighted F1, and Cohen’s Kappa for the training and test cohorts, indicating the overall predictive performance.
Figure 3
Figure 3. Patient outcomes and multivariate analyses of AML Epigenomic Risk groups.
Patient outcomes by AML Epigenomic Risk groups in A) discovery cohort and B) validation cohort with EFS (top) and OS (bottom) of AML Epigenomic Riskhigh (orange) and AML Epigenomic Risklow (blue). Multivariate analysis adjusting for other confounding variables of OS in C) discovery cohort and D) validation cohort and EFS in E) discovery cohort and F) validation cohort. MRD 1, minimal residual disease at end of first induction; FLT3 ITD, FMS-like tyrosine kinase-3 internal tandem duplication; CI, confidence interval; Ref., reference.
Figure 4
Figure 4. Development and testing of 38CpG AML signature.
A) Stepwise workflow for generating the 38CpG-AMLsignature model, including data preprocessing, batch correction, survival analysis, and penalized Cox PH modeling, resulting in a concise 5mC-based prognostic risk score for AML. B) Manhattan plot showing EWAS results with the significance of CpG probes across the genome, highlighting 200 CpGs that remained significant after p<10e-5 threshold. C) Stability analysis of 200 selected CpGs, showing the frequency of non-zero coefficients across 1000 model loops, which led to 38 CpGs being selected. D) Distribution of 38CpG-HazardScores in the discovery (left) and test (right) cohorts, dividing patients into high and low risk based on a 50% cutoff. E) Kaplan-Meier survival curves for EFS (top) and OS (bottom) in the discovery cohort (left, n=946) and F) validation cohort (right, n=200), comparing 38CpG-AMLsignaturehigh (orange) and 38CpG-AMLsignaturelow (blue) groups.
Figure 5
Figure 5. Forest plots of 38CpG AML signature against known confounding factors.
A) Discovery cohort by OS. B) Discovery cohort by EFS. C) Validation cohort by OS. D) Validation cohort by EFS.
Figure 6
Figure 6. Specimen-to-result prospective testing using rapid nanopore long-read sequencing.
A) Nanopore test cohort samples (n=12) mapped onto Acute Leukemia Methylome Atlas (ALMA) based on the combined methylation values of 331,556 CpGs. Each point represents a patient sample colored by ALMA Subtype. Samples with similar epigenomes cluster together, forming the island patterns seen through the scatterplot. Patient UF1829 was misplaced in two dimensions, but correctly placed in five dimensions. B) Sankey plot comparing clinical diagnosis information provided at sample collection (left) with ALMA Subtype classifier predictions (right) for the nanopore test cohort. The flow lines illustrate the correspondence between initial diagnostic classifications and the predicted ALMA subtypes. NUP98-fusion, Nucleoporin 98 fusion; PB, Peripheral Blood; BM, Bone Marrow; MDS, Myelodysplastic Syndrome; S/P: Status Post; Flow, Flow cytometry; TKD, Tyrosine Kinase Domain; CNS, Central Nervous System; BMBx, Bone Marrow Biopsy; NPM1, Nucleophosmin 1.

References

    1. Graff Z., Wachter F., Eapen M., Lehmann L. & Cooper T. Navigating Treatment Options and Communication in Relapsed Pediatric AML. Am Soc Clin Oncol Educ Book 44, e438690 (2024). - PubMed
    1. Lamba J K. et al. Dna methylation clusters and their relation to cytogenetic features in pediatric aml. Cancers (Basel) 12, 1–20 (2020). - PMC - PubMed
    1. Lamba J. K. et al. Integrated Epigenetic and Genetic Analysis Identifies Markers of Prognostic Significance in Pediatric Acute Myeloid Leukemia. Oncotarget vol. 9 www.oncotarget.com (2018). - PMC - PubMed
    1. Kral O. et al. Dna methylation signatures predict cytogenetic subtype and outcome in pediatric acute myeloid leukemia (Aml). Genes (Basel) 12, (2021). - PMC - PubMed
    1. Capper D. et al. DNA methylation-based classification of central nervous system tumours. Nature 555, 469–474 (2018). - PMC - PubMed

Publication types

LinkOut - more resources