An information extraction framework for cohort identification using electronic health records

Hongfang Liu¹, Suzette J Bielinski, Sunghwan Sohn, Sean Murphy, Kavishwar B Wagholikar, Siddhartha R Jonnalagadda, K E Ravikumar, Stephen T Wu, Iftikhar J Kullo, Christopher G Chute

Affiliations

PMID: 24303255
PMCID: PMC3845757

An information extraction framework for cohort identification using electronic health records

Hongfang Liu et al. AMIA Jt Summits Transl Sci Proc. 2013.

. 2013 Mar 18:2013:149-53.

eCollection 2013.

Authors

Hongfang Liu¹, Suzette J Bielinski, Sunghwan Sohn, Sean Murphy, Kavishwar B Wagholikar, Siddhartha R Jonnalagadda, K E Ravikumar, Stephen T Wu, Iftikhar J Kullo, Christopher G Chute

Affiliation

¹ Department of Health Sciences Research, Rochester, MN.

PMID: 24303255
PMCID: PMC3845757

Abstract

Information extraction (IE), a natural language processing (NLP) task that automatically extracts structured or semi-structured information from free text, has become popular in the clinical domain for supporting automated systems at point-of-care and enabling secondary use of electronic health records (EHRs) for clinical and translational research. However, a high performance IE system can be very challenging to construct due to the complexity and dynamic nature of human language. In this paper, we report an IE framework for cohort identification using EHRs that is a knowledge-driven framework developed under the Unstructured Information Management Architecture (UIMA). A system to extract specific information can be developed by subject matter experts through expert knowledge engineering of the externalized knowledge resources used in the framework.

PubMed Disclaimer

Figures

**Figure 1.**
System architecture of the IE framework under cTAKES.

See this image and copyright information in PMC

References

1. Chapman WW , Bridewell W , Hanbury P , Cooper GF , Buchanan BG . Evaluation of negation phrases in narrative clinical reports . Proc AMIA Symp . 2001 : 105 – 109 . - PMC - PubMed
1. Wagholikar KB , Maclaughlin KL , Henry MR , et al. Clinical decision support with automated text processing for cervical cancer screening . Journal of the American Medical Informatics Association: JAMIA . 2012 Sep ; 19 ( 5 ): 833 – 839 . - PMC - PubMed
1. Chapman WW , Gundlapalli AV , South BR , Dowling JN . Natural language processing for biosurveillance . Infectious Disease Informatics and Biosurveillance . 2011 : 279 – 310 .
1. Chute CG , Beck SA , Fisk TB , Mohr DN . The Enterprise Data Trust at Mayo Clinic: a semantically integrated warehouse of biomedical data . Journal of American Medical Informatics Association: JAMIA . 2010 Mar-Apr; 17 ( 2 ): 131 – 135 . - PMC - PubMed
1. Savova GK , Masanz JJ , Ogren PV , et al. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications . Journal of the American Medical Informatics Association : JAMIA . 2010 Sep-Oct; 17 ( 5 ): 507 – 513 . - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

An information extraction framework for cohort identification using electronic health records

Affiliation

An information extraction framework for cohort identification using electronic health records

Authors

Affiliation

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources