Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun:142:104343.
doi: 10.1016/j.jbi.2023.104343. Epub 2023 Mar 17.

Representing and utilizing clinical textual data for real world studies: An OHDSI approach

Affiliations

Representing and utilizing clinical textual data for real world studies: An OHDSI approach

Vipina K Keloth et al. J Biomed Inform. 2023 Jun.

Abstract

Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies.

Keywords: Electronic health records; Natural language processing; Real-world study.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest Dr. Hua Xu and The University of Texas Health Science Center at Houston have research related financial interests at Melax Technologies Inc. Dr. Xiaoyan Wang has related financial interests at Sema4 Mount Sinai Genomics Inc.

Figures

Figure 1:
Figure 1:
Schema of NOTE (left) and NOTE_NLP (right) tables in OMOP CDM 5.4 [30]
Figure 2:
Figure 2:
An overview of the workflow for transforming clinical text in the NOTE table

References

    1. Corrigan-Curay J, Sacks L, Woodcock J. Real-world evidence and real-world data for evaluating drug safety and effectiveness. Jama. 2018;320(9):867–8. - PubMed
    1. Baumfeld Andre E, Reynolds R, Caubel P, Azoulay L, Dreyer NA. Trial designs using real-world data: the changing landscape of the regulatory approval process. Pharmacoepidemiology and Drug Safety. 2020;29(10):1201–12. - PMC - PubMed
    1. Skovlund E, Leufkens H, Smyth J. The use of real-world data in cancer drug development. European Journal of Cancer. 2018;101:69–76. - PubMed
    1. Trojano M, Tintore M, Montalban X, Hillert J, Kalincik T, Iaffaldano P, et al. Treatment decisions in multiple sclerosis—insights from real-world observational studies. Nature Reviews Neurology. 2017;13(2):105–18. - PubMed
    1. U.S. Food and Drug Administration - Real-World Evidence [cited 2022 Jan 30]. Available from: https://www.fda.gov/science-research/science-and-research-special-topics....

Publication types

LinkOut - more resources