Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 28;6(12):3716-3728.
doi: 10.1182/bloodadvances.2021006351.

Prediction of clinical outcome in CLL based on recurrent gene mutations, CLL-IPI variables, and (para)clinical data

Affiliations

Prediction of clinical outcome in CLL based on recurrent gene mutations, CLL-IPI variables, and (para)clinical data

Mehdi Parviz et al. Blood Adv. .

Abstract

A highly variable clinical course, immune dysfunction, and a complex genetic blueprint pose challenges for treatment decisions and the management of risk of infection in patients with chronic lymphocytic leukemia (CLL). In recent years, the use of machine learning (ML) technologies has made it possible to attempt to untangle such heterogeneous disease entities. In this study, using 3 classes of variables (international prognostic index for CLL [CLL-IPI] variables, baseline [para]clinical data, and data on recurrent gene mutations), we built ML predictive models to identify the individual risk of 4 clinical outcomes: death, treatment, infection, and the combined outcome of treatment or infection. Using the predictive models, we assessed to what extent the different classes of variables are predictive of the 4 different outcomes, within both a short-term 2-year outlook and a long-term 5-year outlook after CLL diagnosis. By adding the baseline (para)clinical data to CLL-IPI variables, predictive performance was improved, whereas no further improvement was observed when including the data on recurrent genetic mutations. We discovered 2 main clusters of variables predictive of treatment and infection. Further emphasizing the high mortality resulting from infection in CLL, we found a close similarity between variables predictive of infection in the short-term outlook and those predictive of death in the long-term outlook. We conclude that at the time of CLL diagnosis, routine (para)clinical data are more predictive of patient outcome than recurrent mutations. Future studies on modeling genetics and clinical outcome should always consider the inclusion of several (para)clinical data to improve performance.

PubMed Disclaimer

Figures

None
Graphical abstract
Figure 1.
Figure 1.
Schematic representation of the data collection and analysis. (A) Data from different data sources, including baseline tests (formula image), routine laboratory tests (formula image), and recurrent mutations (formula image), were combined to construct a heterogeneous data set. Prediction point was set at 3 months postdiagnosis, and clinical outcomes (formula image) were predicted. (B) The clinical outcomes were death (formula image), treatment (formula image), the combined event of treatment or infection (composite), and infection (formula image). (C) Based on the combination of feature sets, 4 models were defined: (1) IPI, which included CLL-IPI score and the CLL-IPI features only; (2) +BL, which included CLL-IPI features, baseline tests, and routine laboratory tests; (3) +MUT, which included CLL-IPI features and recurrent mutations; and (4) ALL, which included all features. (D) Clinical outcomes were predicted in 2- and 5-year outlooks postdiagnosis (except for the first 3 months). (E) The data from different sources were merged to create one data set (formula image). Then, for a specific outcome and outlook, the target values were created and later used in the training/test (formula image). Based on the model, feature extraction was performed (formula image). A stacked ML model consisting of 7 algorithms and a fusion stage based on majority voting was trained and tested. The performance of the models (formula image) and the contribution of the features (formula image) were estimated to identify the risk factors predictive of each combination of outcome, model, and outlook. tNGS, targeted next-generation sequencing.
Figure 2.
Figure 2.
Comparison of the performance of the models in predicting clinical outcomes. Box plots depicting the performance of the 4 models in predicting death, treatment, infection, and the combined outcome of infection or treatment within 2-year (A,C,E,G) and 5-year (B,D,F,H) outlooks postdiagnosis. In each subplot, the box plots show the quartiles, the median, and the outliers of MCC values obtained using fivefold cross-validation on set A. The scatter plots (single marker for each model) demonstrate the MCC value of the holdout validation on set C. Death (A-B), treatment (C-D), treatment or infection (composite) (E-F), and infection (G-H). *P =.05, ***P =.001.
Figure 3.
Figure 3.
Identified risk factors predictive of the outcomes. SHAP plots on the full cohort illustrate the contribution of the most important features in predicting the clinical (mean absolute SHAP values >0.01) in 2-year outlook (A-D) and 5-year outlook (E-H). Death (A,E), treatment (B,F), treatment or infection (C,G), and infection (D,H). The cluster of features predictive of different outcomes was detected after sorting the features so that the most important features predictive of treatment appear at the top (red) and the most important features predictive of infection appear at the bottom (blue). The features more predictive of death or the composite outcome and not treatment or infection appear in the middle. ECOG, Eastern Cooperative Oncology Group; Ig, immunoglobulin.
Figure 4.
Figure 4.
Hierarchic clustering of risk factors predictive of clinical outcomes in the 2 outlooks. The similarity between each pair of risk factor patterns was calculated using cosine similarity, which is the cosine of the angle between 2 n-dimensional vectors. Then, the risk factor patterns were grouped by performing hierarchic clustering on the computed similarity matrix.
Figure 5.
Figure 5.
Event-free survival plots for 2-year outlook on the full cohort. Patients were stratified by the classifier using CLL-IPI variables (TP53 aberrations, IGHV mutational status, B2M level, clinical stage, and age [IPI]) (A,C,E,G) and addition of (para)clinical variables to the CLL-IPI variables (+BL) (B,D,F,H). Overall survival (A-B), treatment-free survival (C-D), treatment and infection–free survival (E-F), and infection-free survival (G-H). HR, hazard ratio.
Figure 6.
Figure 6.
Event-free survival plots for 5-year outlook on the full cohort. Patients were stratified by the classifier using CLL-IPI variables (TP53 aberrations, IGHV mutational status, B2M level, clinical stage, and age [IPI]) (A,C,E,G) and addition of (para)clinical variables to the CLL-IPI variables (+BL) (B,D,F,H). Overall survival (A-B), treatment-free survival (C-D), treatment and infection–free survival (E-F), and infection-free survival (G-H). HR, hazard ratio.

References

    1. Hallek M, Fischer K, Fingerle-Rowson G, et al. ; German Chronic Lymphocytic Leukaemia Study Group . Addition of rituximab to fludarabine and cyclophosphamide in patients with chronic lymphocytic leukaemia: a randomised, open-label, phase 3 trial. Lancet. 2010;376(9747):1164-1174. - PubMed
    1. Goede V, Fischer K, Busch R, et al. . Obinutuzumab plus chlorambucil in patients with CLL and coexisting conditions. N Engl J Med. 2014;370(12):1101-1110. - PubMed
    1. da Cunha-Bang C, Simonsen J, Rostgaard K, Geisler C, Hjalgrim H, Niemann CU. Improved survival for patients diagnosed with chronic lymphocytic leukemia in the era of chemo-immunotherapy: a Danish population-based study of 10455 patients. Blood Cancer J. 2016;6(11):e499. - PMC - PubMed
    1. Munir T, Brown JR, O’Brien S, et al. . Final analysis from RESONATE: up to six years of follow-up on ibrutinib in patients with previously treated chronic lymphocytic leukemia or small lymphocytic lymphoma. Am J Hematol. 2019;94(12):1353-1363. - PMC - PubMed
    1. Fischer K, Al-Sawaf O, Bahlo J, et al. . Venetoclax and obinutuzumab in patients with CLL and coexisting conditions. N Engl J Med. 2019;380(23):2225-2236. - PubMed

Publication types

MeSH terms