Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 3;26(6):1163-1170.
doi: 10.1093/neuonc/noad249.

Developing a computable phenotype for glioblastoma

Affiliations

Developing a computable phenotype for glioblastoma

Sandra Yan et al. Neuro Oncol. .

Abstract

Background: Glioblastoma is the most common malignant brain tumor, and thus it is important to be able to identify patients with this diagnosis for population studies. However, this can be challenging as diagnostic codes are nonspecific. The aim of this study was to create a computable phenotype (CP) for glioblastoma multiforme (GBM) from structured and unstructured data to identify patients with this condition in a large electronic health record (EHR).

Methods: We used the University of Florida (UF) Health Integrated Data Repository, a centralized clinical data warehouse that stores clinical and research data from various sources within the UF Health system, including the EHR system. We performed multiple iterations to refine the GBM-relevant diagnosis codes, procedure codes, medication codes, and keywords through manual chart review of patient data. We then evaluated the performances of various possible proposed CPs constructed from the relevant codes and keywords.

Results: We underwent six rounds of manual chart reviews to refine the CP elements. The final CP algorithm for identifying GBM patients was selected based on the best F1-score. Overall, the CP rule "if the patient had at least 1 relevant diagnosis code and at least 1 relevant keyword" demonstrated the highest F1-score using both structured and unstructured data. Thus, it was selected as the best-performing CP rule.

Conclusions: We developed and validated a CP algorithm for identifying patients with GBM using both structured and unstructured EHR data from a large tertiary care center. The final algorithm achieved an F1-score of 0.817, indicating a high performance, which minimizes possible biases from misclassification errors.

Keywords: Electronic Health Records (EHRs); computable phenotype; glioblastoma; structured data; unstructured data.

PubMed Disclaimer

Conflict of interest statement

D.A.M. holds patented technologies that have been licensed or have exclusive options to license to Celldex Therapeutics, Annias, Immunomic Therapeutics, and iOncologi. D.A.M. received research funding from Immunomic Therapeutics. D.A.M. serves/served as an advisor/consultant to Bristol-Myers Squibb, Tocagen, Oncorus, and RM Global. D.A.M. is co-founder of iOncologi, Inc., an immuno-oncology biotechnology company. A.P.G. serves/served as an advisor/consultant to Neosoma and Monteris Medical. A.P.G. has received honoraria for advisory board participation from Alexion Pharmaceuticals, Servier, and Aptitude Health. A.P.G. has held stock in Viatris Inc. All other authors have no conflicts to report.

Figures

Figure 1.
Figure 1.
Workflow for the development of the computable phenotype algorithm.

References

    1. Luo C, Song K, Wu S, et al.. The prognosis of glioblastoma: a large, multifactorial study. Br J Neurosurg. 2021;35(5):555–561. - PubMed
    1. Chen B, Chen C, Zhang Y, Xu J.. Recent incidence trend of elderly patients with glioblastoma in the United States, 2000-2017. BMC Cancer. 2021;21(1):54. - PMC - PubMed
    1. Oyer RA, Hurley P, Boehmer L, et al.. Increasing racial and ethnic diversity in cancer clinical trials: an American Society of Clinical Oncology and Association of Community Cancer Centers Joint Research statement. J Clin Oncol. 2022;40(19):2163–2171. - PubMed
    1. Taha B, Winston G, Tosi U, et al.. Missing diversity in brain tumor trials. Neuro-Oncol Adv. 2020;2(1):vdaa059. - PMC - PubMed
    1. Real-World Evidence. Where are we now. NEJM. https://www.nejm.org/doi/full/10.1056/NEJMp2200089. Accessed August 14, 2023. - DOI

Publication types

LinkOut - more resources