Development and evaluation of a computable phenotype to identify pediatric patients with leukemia and lymphoma treated with chemotherapy using electronic health record data
- PMID: 31207054
- PMCID: PMC7135896
- DOI: 10.1002/pbc.27876
Development and evaluation of a computable phenotype to identify pediatric patients with leukemia and lymphoma treated with chemotherapy using electronic health record data
Abstract
Background: Widespread implementation of electronic health records (EHR) has created new opportunities for pediatric oncology observational research. Little attention has been given to using EHR data to identify patients with pediatric hematologic malignancies.
Methods: This study used EHR-derived data in a pediatric clinical data research network, PEDSnet, to develop and evaluate a computable phenotype algorithm to identify pediatric patients with leukemia and lymphoma who received treatment with chemotherapy. To guide early development, multiple computable phenotype-defined cohorts were compared to one institution's tumor registry. The most promising algorithm was chosen for formal evaluation and consisted of at least two leukemia/lymphoma diagnoses (Systematized Nomenclature of Medicine codes) within a 90-day period, two chemotherapy exposures, and three hematology-oncology provider encounters. During evaluation, the computable phenotype was executed against EHR data from 2011 to 2016 at three large institutions. Classification accuracy was assessed by masked medical record review with phenotype-identified patients compared to a control group with at least three hematology-oncology encounters.
Results: The computable phenotype had sensitivity of 100% (confidence interval [CI] 99%, 100%), specificity of 99% (CI 99%, 100%), positive predictive value (PPV) and negative predictive value (NPV) of 100%, and C-statistic of 1 at the development institution. The computable phenotype performance was similar at the two test institutions with sensitivity of 100% (CI 99%, 100%), specificity of 99% (CI 99%, 100%), PPV of 96%, NPV of 100%, and C-statistic of 0.99.
Conclusion: The EHR-based computable phenotype is an accurate cohort identification tool for pediatric patients with leukemia and lymphoma who have been treated with chemotherapy and is ready for use in clinical studies.
Keywords: computable phenotype; epidemiology; leukemias (acute); lymphoma; pediatric oncology.
© 2019 Wiley Periodicals, Inc.
Conflict of interest statement
Conflict of Interest Statement:
The authors have no conflicts of interest.
Figures


Similar articles
-
Using Electronic Health Record Data to Rapidly Identify Children with Glomerular Disease for Clinical Research.J Am Soc Nephrol. 2019 Dec;30(12):2427-2435. doi: 10.1681/ASN.2019040365. Epub 2019 Nov 15. J Am Soc Nephrol. 2019. PMID: 31732612 Free PMC article.
-
Development and evaluation of an EHR-based computable phenotype for identification of pediatric Crohn's disease patients in a National Pediatric Learning Health System.Learn Health Syst. 2020 Aug 28;4(4):e10243. doi: 10.1002/lrh2.10243. eCollection 2020 Oct. Learn Health Syst. 2020. PMID: 33083542 Free PMC article.
-
A Computable Phenotype Improves Cohort Ascertainment in a Pediatric Pulmonary Hypertension Registry.J Pediatr. 2017 Sep;188:224-231.e5. doi: 10.1016/j.jpeds.2017.05.037. Epub 2017 Jun 16. J Pediatr. 2017. PMID: 28625502 Free PMC article.
-
Statistical Methods for Phenotype Estimation and Analysis Using Electronic Health Records [Internet].Washington (DC): Patient-Centered Outcomes Research Institute (PCORI); 2021 Mar. Washington (DC): Patient-Centered Outcomes Research Institute (PCORI); 2021 Mar. PMID: 39133799 Free Books & Documents. Review.
-
Trends and opportunities in computable clinical phenotyping: A scoping review.J Biomed Inform. 2023 Apr;140:104335. doi: 10.1016/j.jbi.2023.104335. Epub 2023 Mar 16. J Biomed Inform. 2023. PMID: 36933631
Cited by
-
Automated Electronic Health Record Data Extraction and Curation Using ExtractEHR.JCO Clin Cancer Inform. 2024 Nov;8:e2400100. doi: 10.1200/CCI.24.00100. Epub 2024 Nov 25. JCO Clin Cancer Inform. 2024. PMID: 39586036 Free PMC article.
-
Implementation science in pediatric oncology: A narrative review and future directions.Pediatr Blood Cancer. 2022 Apr;69(4):e29579. doi: 10.1002/pbc.29579. Epub 2022 Jan 19. Pediatr Blood Cancer. 2022. PMID: 35044081 Free PMC article. Review.
-
Clinical comparison between trial participants and potentially eligible patients using electronic health record data: A generalizability assessment method.J Biomed Inform. 2021 Jul;119:103822. doi: 10.1016/j.jbi.2021.103822. Epub 2021 May 25. J Biomed Inform. 2021. PMID: 34044156 Free PMC article.
-
Electronic health records identify timely trends in childhood mental health conditions.Child Adolesc Psychiatry Ment Health. 2023 Sep 14;17(1):107. doi: 10.1186/s13034-023-00650-7. Child Adolesc Psychiatry Ment Health. 2023. PMID: 37710303 Free PMC article.
-
Medication based machine learning to identify subpopulations of pediatric hemodialysis patients in an electronic health record database.Inform Med Unlocked. 2022;34:101104. doi: 10.1016/j.imu.2022.101104. Epub 2022 Oct 6. Inform Med Unlocked. 2022. PMID: 36405250 Free PMC article.
References
-
- Howlader N NA, Krapcho M, Miller D, Bishop K, Kosary CL, Yu M, Ruhl J, Tatalovich Z, Mariotto A, Lewis DR, Chen HS, Feuer EJ, Cronin KA: SEER Cancer Statistics Review, 1975–2014. Bethesda, MD, 2016
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical