Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 9;8(1):70-79.
doi: 10.1182/bloodadvances.2023011076.

Deep learning predicts therapy-relevant genetics in acute myeloid leukemia from Pappenheim-stained bone marrow smears

Affiliations

Deep learning predicts therapy-relevant genetics in acute myeloid leukemia from Pappenheim-stained bone marrow smears

Jacqueline Kockwelp et al. Blood Adv. .

Abstract

The detection of genetic aberrations is crucial for early therapy decisions in acute myeloid leukemia (AML) and recommended for all patients. Because genetic testing is expensive and time consuming, a need remains for cost-effective, fast, and broadly accessible tests to predict these aberrations in this aggressive malignancy. Here, we developed a novel fully automated end-to-end deep learning pipeline to predict genetic aberrations directly from single-cell images from scans of conventionally stained bone marrow smears already on the day of diagnosis. We used this pipeline to compile a multiterabyte data set of >2 000 000 single-cell images from diagnostic samples of 408 patients with AML. These images were then used to train convolutional neural networks for the prediction of various therapy-relevant genetic alterations. Moreover, we created a temporal test cohort data set of >444 000 single-cell images from further 71 patients with AML. We show that the models from our pipeline can significantly predict these subgroups with high areas under the curve of the receiver operating characteristic. Potential genotype-phenotype links were visualized with 2 different strategies. Our pipeline holds the potential to be used as a fast and inexpensive automated tool to screen patients with AML for therapy-relevant genetic aberrations directly from routine, conventionally stained bone marrow smears already on the day of diagnosis. It also creates a foundation to develop similar approaches for other bone marrow disorders in the future.

PubMed Disclaimer

Conflict of interest statement

Conflict-of-interest disclosure: The authors declare no competing financial interests.

Figures

None
Graphical abstract
Figure 1.
Figure 1.
Overview of our pipeline. (Left to right) Whole bone marrow smear scan images are processed using a CFM to automatically extract the most relevant single-cell crops, which are rudimentary classified into 4 cell classes. Subsequently, a GFEN is used to predict 5 genetic indicators that are important for first-line therapy decisions (CBFB::MYH11, MRC cytogenetics, FLT3mut, NPM1mut, and ELN 2017 favorable risk) based on appropriate single-cell images only. Additionally, visualization strategies were used to gain explainability with respect to the used deep learning models.
Figure 2.
Figure 2.
Randomly sampled example image for the classes of the CFM. (A) Images for the 4 classes of the general CFM and (B) of the eCFM are shown.
Figure 3.
Figure 3.
Cross-validated prediction of therapeutically relevant genetic groups using the GFEN on the discovery cohort. Deep learning models were trained to predict therapy relevant genetic subgroups from bone marrow smears. The patient-level performance of the GFEN is evaluated for every genetically defined group (ELN 2017 favorable risk, CBFB::MYH11 fusions, NPM1 mutations, FLT3-ITD and -tyrosine kinase domain (TKD) mutations, and MRC cytogenetics) with the AUROC and 2-sided P value for the prediction scores. Values for the validation runs of the fivefold cross-validation are given for the individual folds (cv0, cv1, cv2, cv3, and cv4) and all validation folds of the complete cohort combined (compiled). Error bars show 95% CI. The dashed line represents a P value <.05.
Figure 4.
Figure 4.
Highest scoring single-cell images from occlusion sensitivity analysis for the different genetic categories. The images represent the most important images for the decision toward a label.
Figure 5.
Figure 5.
Single-cell saliency maps for different genetic categories. The importance of individual pixel for the classification decision are calculated based on gradient computations in the GFEN model and are overlaid onto the original image (high gradient: yellow to green).

Comment in

References

    1. Döhner H, Estey E, Grimwade D, et al. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood. 2017;129(4):424–447. - PMC - PubMed
    1. Tallman MS, Wang ES, Altman JK, et al. Acute myeloid leukemia, version 3.2019, NCCN Clinical Practice Guidelines in oncology. J Natl Compr Canc Netw. 2019;17(6):721–749. - PubMed
    1. Heuser M, Ofran Y, Boissel N, et al. Acute myeloid leukaemia in adult patients: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol. 2020;31(6):697–712. - PubMed
    1. Arber DA, Orazi A, Hasserjian R, et al. The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. Blood. 2016;127(20):2391–2405. - PubMed
    1. Castaigne S, Pautas C, Terré C, et al. Effect of gemtuzumab ozogamicin on survival of adult patients with de-novo acute myeloid leukaemia (ALFA-0701): a randomised, open-label, phase 3 study. Lancet. 2012;379(9825):1508–1516. - PubMed

Publication types

MeSH terms