Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul-Sep;29(3):14604582231198021.
doi: 10.1177/14604582231198021.

Machine learning-based natural language processing to extract PD-L1 expression levels from clinical notes

Affiliations
Free article

Machine learning-based natural language processing to extract PD-L1 expression levels from clinical notes

Eric Lin et al. Health Informatics J. 2023 Jul-Sep.
Free article

Abstract

Introduction: PD-L1 expression is used to determine oncology patients' response to and eligibility for immunologic treatments; however, PD-L1 expression status often only exists in unstructured clinical notes, limiting ability to use it in population-level studies. Methods: We developed and evaluated a machine learning based natural language processing (NLP) tool to extract PD-L1 expression values from the nationwide Veterans Affairs electronic health record system. Results: The model demonstrated strong evaluation performance across multiple levels of label granularity. Mean precision of the overall PD-L1 positive label was 0.859 (sd, 0.039), recall 0.994 (sd, 0.013), and F1 0.921 (0.024). When a numeric PD-L1 value was identified, the mean absolute error of the value was 0.537 on a scale of 0 to 100. Conclusion: We presented an accurate NLP method for deriving PD-L1 status from clinical notes. By reducing the time and manual effort needed to review medical records, our work will enable future population-level studies in cancer immunotherapy.

Keywords: PD-l1; cancer; electronic health records; machine learning; natural language processing.

PubMed Disclaimer

Publication types

LinkOut - more resources