Predicting therapeutic clinical trial enrollment for adult patients with low- and high-grade glioma using supervised machine learning

Mulki Mehari¹, Gayathri Warrier¹, Abraham Dada¹, Aymen Kabir¹, Aden P Haskell-Mendoza², Arushi Tripathy³, Rohan Jha⁴, Edwin Nieblas-Bedolla³, Joshua D Jackson², Ariel T Gonzalez², Ellery H Reason², Ann Marie Flusche², Sheantel Reihl⁵, Tara Dalton², Mikias Negussie¹, Cesar Nava Gonzales¹, Vardhaan S Ambati¹, Annick Desjardins⁶, Andy G S Daniel¹, Saritha Krishna¹, Susan Chang⁷, Alyx Porter⁸, Peter E Fecci⁶, Todd Hollon⁹, Ugonma N Chukwueke¹⁰, Kimberly Badal¹¹, Annette M Molinaro^{1

12}, Shawn L Hervey-Jumper^{1

12}

Affiliations

¹ Department of Neurosurgery, University of California, San Francisco, San Francisco, CA 94143, USA.
² Department of Neurosurgery, Duke University Medical Center, Durham, NC 27710, USA.
³ Department of Neurosurgery, Michigan Medicine, Ann Arbor, MI 48109, USA.
⁴ Harvard Medical School, Boston, MA 02115, USA.
⁵ Department of Neurosurgery, University of California, Los Angeles, Los Angeles, CA 90095, USA.
⁶ Preston Robert Tisch Brain Tumor Center, Department of Neurosurgery, Duke University Medical Center, Durham, NC 27710, USA.
⁷ Division of Neuro-Oncology and Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.
⁸ Division of Neuro-Oncology, Department of Neurology, Mayo Clinic, Phoenix, AZ 85054, USA.
⁹ Machine Learning in Neurosurgery Laboratory, Department of Neurosurgery, University of Michigan, Ann Arbor, MI 48109, USA.
¹⁰ Center for Neuro-Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA.
¹¹ Department of Surgery, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA.
¹² Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.

PMID: 40465719
PMCID: PMC12136030
DOI: 10.1126/sciadv.adt5708

Predicting therapeutic clinical trial enrollment for adult patients with low- and high-grade glioma using supervised machine learning

Mulki Mehari et al. Sci Adv. 2025.

. 2025 Jun 6;11(23):eadt5708.

doi: 10.1126/sciadv.adt5708. Epub 2025 Jun 4.

Authors

Affiliations

¹ Department of Neurosurgery, University of California, San Francisco, San Francisco, CA 94143, USA.
² Department of Neurosurgery, Duke University Medical Center, Durham, NC 27710, USA.
³ Department of Neurosurgery, Michigan Medicine, Ann Arbor, MI 48109, USA.
⁴ Harvard Medical School, Boston, MA 02115, USA.
⁵ Department of Neurosurgery, University of California, Los Angeles, Los Angeles, CA 90095, USA.
⁶ Preston Robert Tisch Brain Tumor Center, Department of Neurosurgery, Duke University Medical Center, Durham, NC 27710, USA.
⁷ Division of Neuro-Oncology and Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.
⁸ Division of Neuro-Oncology, Department of Neurology, Mayo Clinic, Phoenix, AZ 85054, USA.
⁹ Machine Learning in Neurosurgery Laboratory, Department of Neurosurgery, University of Michigan, Ann Arbor, MI 48109, USA.
¹⁰ Center for Neuro-Oncology, Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA.
¹¹ Department of Surgery, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA.
¹² Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, USA.

PMID: 40465719
PMCID: PMC12136030
DOI: 10.1126/sciadv.adt5708

Abstract

Therapeutic clinical trial enrollment does not match glioma incidence across demographics. Traditional statistical methods have identified independent predictors of trial enrollment; however, our understanding of the interactions between these factors remains limited. To test the interactive effects of demographic, socioeconomic, and oncologic variables on trial enrollment, we designed boosted neural networks (BNNs) for all glioma patients (n = 1042), women (n = 445, 42.7%), and minorities (n = 151, 14.5%) and externally validated these models [whole cohort, n = 230; women, n = 89 (38.7%); minority, n = 66 (28.7%)]. For the whole-cohort BNN, the most influential variables on enrollment were oncologic variables, including KPS [total effect (TE), 0.327], chemotherapy (TE, 0.326), tumor location (TE, 0.322), and seizures (TE, 0.239). The women-only BNN exhibited a similar trend. Conversely, for the minority-only BNN, socioeconomic variables [insurance status (TE, 0.213), occupation classification (TE, 0.204), and employment status (TE, 0.150)] were most influential. These results may help prioritize patient-specific initiatives to increase accrual.

PubMed Disclaimer

Figures

**Fig. 1.. ROC curves and ranked variables of importance for BNN models for trial enrollment.**
The figure depicts the receiver operator characteristic (ROC) curves and variables of importance for predicting therapeutic clinical trial enrollment using BNN models for the whole, women-only, and minority-only cohorts. In all ROC curves, the red lines represent clinical trial enrollment, while the blue lines represent lack of enrollment. The AUC values for each ROC curve are listed. The main and total effects for each variable are listed in variables of importance output for each model. Adjacent to the variables of importance outputs are color-coded heatmaps indicating whether the ranked factors are demographic, socioeconomic, or oncologic variables. (A) The ROC curve for the whole development cohort. (B) The ROC curve for the whole validation cohort. (C) The variables of importance for the whole-cohort model in descending order of importance. (D) The ROC curve for the women development cohort. (E) The ROC curve for the women validation cohort. (F) The ranked variables of importance for the women cohort model in descending order of importance. (G) The ROC curve for the minority development cohort. (H) The ROC curve for the minority validation cohort. (I) The ranked variables of importance for the minority cohort model in descending order of importance.

**Fig. 2.. Shapley values for influential predictors for the whole-cohort BNN model.**
The figure shows the distribution of Shapley values for each influential predictor in the whole-cohort BNN model. Each dot corresponds to the Shapley value for each predictor for a single patient. Positive and negative Shapley values indicate increased and decreased probability of enrollment, respectively. Each predictor is stratified into two levels, with the characteristic associated with higher mean Shapley values color coded in dark blue and the characteristic with the lower mean Shapley values color coded in dark red. t tests were used to assess for differences in mean Shapley values for each predictor. Predictors are arranged from most influential to least influential (left to right) according to the variables of importance calculation for the predictive model. **P < 0.01; ***P < 0.0001.

**Fig. 3.. GPREDICT schematic for BNN development and validation for the whole cohort, women-only cohort, and minority-only cohort.**
The figure represents four academic medical centers, and the schematic depicts the sequence of dividing the whole cohort into the women-only and minority-only cohorts for model development, model screening with internal cross-validation, and external validation.

See this image and copyright information in PMC

References

1. Chang S. M., Barker F. G. II, Schmidt M. H., Sloan A. E., Kasper R., Phillips L., Shih K., Hariharan S., Berger M. S., The Glioma Outcomes Investigators , Clinical trial participation among patients enrolled in the Glioma Outcomes Project. Cancer 94, 2681–2687 (2002). - PubMed
1. Escritt K., Mann M., Nelson A., Harrop E., Hope and meaning-making in phase 1 oncology trials: A systematic review and thematic synthesis of qualitative evidence on patient-participant experiences. Trials 23, 409 (2022). - PMC - PubMed
1. Reihl S. J., Patil N., Morshed R. A., Mehari M., Aabedi A., Chukwueke U. N., Porter A. B., Fontil V., Cioffi G., Waite K., Kruchko C., Ostrom Q., Barnholtz-Sloan J., Hervey-Jumper S. L., A population study of clinical trial accrual for women and minorities in neuro-oncology following the NIH Revitalization Act. Neuro Oncol. 24, 1341–1349 (2022). - PMC - PubMed
1. Murthy V. H., Krumholz H. M., Gross C. P., Participation in cancer clinical trials: Race-, sex-, and age-based disparities. JAMA 291, 2720–2726 (2004). - PubMed
1. US Congress, National Institutes of Health Revitalization Act of 1993, Public Law 103-43 (Washington, DC, 10 June 1993).

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Predicting therapeutic clinical trial enrollment for adult patients with low- and high-grade glioma using supervised machine learning

Affiliations

Predicting therapeutic clinical trial enrollment for adult patients with low- and high-grade glioma using supervised machine learning

Authors

Affiliations

Abstract

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical