Improved personalized survival prediction of patients with diffuse large B-cell Lymphoma using gene expression profiling
- PMID: 33087075
- PMCID: PMC7579992
- DOI: 10.1186/s12885-020-07492-y
Improved personalized survival prediction of patients with diffuse large B-cell Lymphoma using gene expression profiling
Abstract
Background: Thirty to forty percent of patients with Diffuse Large B-cell Lymphoma (DLBCL) have an adverse clinical evolution. The increased understanding of DLBCL biology has shed light on the clinical evolution of this pathology, leading to the discovery of prognostic factors based on gene expression data, genomic rearrangements and mutational subgroups. Nevertheless, additional efforts are needed in order to enable survival predictions at the patient level. In this study we investigated new machine learning-based models of survival using transcriptomic and clinical data.
Methods: Gene expression profiling (GEP) of in 2 different publicly available retrospective DLBCL cohorts were analyzed. Cox regression and unsupervised clustering were performed in order to identify probes associated with overall survival on the largest cohort. Random forests were created to model survival using combinations of GEP data, COO classification and clinical information. Cross-validation was used to compare model results in the training set, and Harrel's concordance index (c-index) was used to assess model's predictability. Results were validated in an independent test set.
Results: Two hundred thirty-three and sixty-four patients were included in the training and test set, respectively. Initially we derived and validated a 4-gene expression clusterization that was independently associated with lower survival in 20% of patients. This pattern included the following genes: TNFRSF9, BIRC3, BCL2L1 and G3BP2. Thereafter, we applied machine-learning models to predict survival. A set of 102 genes was highly predictive of disease outcome, outperforming available clinical information and COO classification. The final best model integrated clinical information, COO classification, 4-gene-based clusterization and the expression levels of 50 individual genes (training set c-index, 0.8404, test set c-index, 0.7942).
Conclusion: Our results indicate that DLBCL survival models based on the application of machine learning algorithms to gene expression and clinical data can largely outperform other important prognostic variables such as disease stage and COO. Head-to-head comparisons with other risk stratification models are needed to compare its usefulness.
Keywords: DLBCL; Lymphoma; Prediction; Survival; Transcriptomics.
Conflict of interest statement
The authors declare no competing interests.
Figures



Similar articles
-
Prediction of survival in diffuse large B-cell lymphoma based on the expression of 2 genes reflecting tumor and microenvironment.Blood. 2011 Aug 4;118(5):1350-8. doi: 10.1182/blood-2011-03-345272. Epub 2011 Jun 13. Blood. 2011. PMID: 21670469 Free PMC article.
-
Dissection of DLBCL microenvironment provides a gene expression-based predictor of survival applicable to formalin-fixed paraffin-embedded tissue.Ann Oncol. 2018 Dec 1;29(12):2363-2370. doi: 10.1093/annonc/mdy450. Ann Oncol. 2018. PMID: 30307529 Free PMC article.
-
Discovery and validation of immune-associated long non-coding RNA biomarkers associated with clinically molecular subtype and prognosis in diffuse large B cell lymphoma.Mol Cancer. 2017 Jan 19;16(1):16. doi: 10.1186/s12943-017-0580-4. Mol Cancer. 2017. PMID: 28103885 Free PMC article.
-
Genomic profiling using array comparative genomic hybridization define distinct subtypes of diffuse large B-cell lymphoma: a review of the literature.J Hematol Oncol. 2012 Sep 11;5:54. doi: 10.1186/1756-8722-5-54. J Hematol Oncol. 2012. PMID: 22967872 Free PMC article. Review.
-
Diffuse large B cell lymphoma: from gene expression profiling to prediction of outcome.Biol Blood Marrow Transplant. 2008 Jan;14(1 Suppl 1):108-11. doi: 10.1016/j.bbmt.2007.10.020. Biol Blood Marrow Transplant. 2008. PMID: 18162230 Free PMC article. Review.
Cited by
-
Genetic and transcriptomic analyses of diffuse large B-cell lymphoma patients with poor outcomes within two years of diagnosis.Leukemia. 2024 Mar;38(3):610-620. doi: 10.1038/s41375-023-02120-7. Epub 2023 Dec 29. Leukemia. 2024. PMID: 38158444 Free PMC article.
-
Prognostic Stratification of Diffuse Large B-cell Lymphoma Using Clinico-genomic Models: Validation and Improvement of the LymForest-25 Model.Hemasphere. 2022 Mar 25;6(4):e706. doi: 10.1097/HS9.0000000000000706. eCollection 2022 Apr. Hemasphere. 2022. PMID: 35392483 Free PMC article.
-
Avoiding C-hacking when evaluating survival distribution predictions with discrimination measures.Bioinformatics. 2022 Sep 2;38(17):4178-4184. doi: 10.1093/bioinformatics/btac451. Bioinformatics. 2022. PMID: 35818973 Free PMC article.
-
Incorporation of emergent symptoms and genetic covariates improves prediction of aromatase inhibitor therapy discontinuation.JAMIA Open. 2024 Jan 19;7(1):ooae006. doi: 10.1093/jamiaopen/ooae006. eCollection 2024 Apr. JAMIA Open. 2024. PMID: 38250582 Free PMC article.
-
A prognostic model based on gene expression parameters predicts a better response to bortezomib-containing immunochemotherapy in diffuse large B-cell lymphoma.Front Oncol. 2023 Apr 28;13:1157646. doi: 10.3389/fonc.2023.1157646. eCollection 2023. Front Oncol. 2023. PMID: 37188190 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials