A review of approaches to identifying patient phenotype cohorts using electronic health records

Chaitanya Shivade¹, Preethi Raghavan, Eric Fosler-Lussier, Peter J Embi, Noemie Elhadad, Stephen B Johnson, Albert M Lai

Affiliations

PMID: 24201027
PMCID: PMC3932460
DOI: 10.1136/amiajnl-2013-001935

Review

A review of approaches to identifying patient phenotype cohorts using electronic health records

Chaitanya Shivade et al. J Am Med Inform Assoc. 2014 Mar-Apr.

. 2014 Mar-Apr;21(2):221-30.

doi: 10.1136/amiajnl-2013-001935. Epub 2013 Nov 7.

Authors

Chaitanya Shivade¹, Preethi Raghavan, Eric Fosler-Lussier, Peter J Embi, Noemie Elhadad, Stephen B Johnson, Albert M Lai

Affiliation

¹ Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA.

PMID: 24201027
PMCID: PMC3932460
DOI: 10.1136/amiajnl-2013-001935

Abstract

Objective: To summarize literature describing approaches aimed at automatically identifying patients with a common phenotype.

Materials and methods: We performed a review of studies describing systems or reporting techniques developed for identifying cohorts of patients with specific phenotypes. Every full text article published in (1) Journal of American Medical Informatics Association, (2) Journal of Biomedical Informatics, (3) Proceedings of the Annual American Medical Informatics Association Symposium, and (4) Proceedings of Clinical Research Informatics Conference within the past 3 years was assessed for inclusion in the review. Only articles using automated techniques were included.

Results: Ninety-seven articles met our inclusion criteria. Forty-six used natural language processing (NLP)-based techniques, 24 described rule-based systems, 41 used statistical analyses, data mining, or machine learning techniques, while 22 described hybrid systems. Nine articles described the architecture of large-scale systems developed for determining cohort eligibility of patients.

Discussion: We observe that there is a rise in the number of studies associated with cohort identification using electronic medical records. Statistical analyses or machine learning, followed by NLP techniques, are gaining popularity over the years in comparison with rule-based systems.

Conclusions: There are a variety of approaches for classifying patients into a particular phenotype. Different techniques and data sources are used, and good performance is reported on datasets at respective institutions. However, no system makes comprehensive use of electronic medical records addressing all of their known weaknesses.

Keywords: Cohort Identification; Electronic Health Records; Phenotyping; Review.

PubMed Disclaimer

Figures

**Figure 1**
Flow chart of study inclusion.

See this image and copyright information in PMC

References

1. Mathias JS, Gossett D, Baker DW. Use of electronic health record data to evaluate overuse of cervical cancer screening. J Am Med Inform Assoc 2012;19:e96–101 - PMC - PubMed
1. Strom BL, Schinnar R, Jones J, et al. Detecting pregnancy use of non-hormonal category X medications in electronic medical records. J Am Med Inform Assoc 2011;18(Suppl 1):i81–6 - PMC - PubMed
1. Garvin JH, DuVall SL, South BR, et al. Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure. J Am Med Inform Assoc 2012;19:859–66 - PMC - PubMed
1. Sun J, Hu J, Luo D, et al. Combining knowledge and data driven insights for identifying risk factors using electronic health records. AMIA Annu Symp Proc 2012;2012:901–10 - PMC - PubMed
1. Son C-S, Kim Y-N, Kim H-S, et al. Decision-making model for early diagnosis of congestive heart failure using rough set and decision tree approaches. J Biomed Inform 2012;45:999–1008 - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A review of approaches to identifying patient phenotype cohorts using electronic health records

Affiliation

A review of approaches to identifying patient phenotype cohorts using electronic health records

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous