. 2019 Jul 18;19(1):155.

doi: 10.1186/s12874-019-0792-y.

Current approaches to identify sections within clinical narratives from electronic health records: a systematic review

Alexandra Pomares-Quimbaya¹, Markus Kreuzthaler², Stefan Schulz²

Affiliations

¹ Pontificia Universidad Javeriana, Cra. 7 No 40-62, Bogotá, 110231, Colombia. pomares@javeriana.edu.co.
² Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerplatz 2, Graz, 8036, Austria.

PMID: 31319802
PMCID: PMC6637496
DOI: 10.1186/s12874-019-0792-y

Current approaches to identify sections within clinical narratives from electronic health records: a systematic review

Alexandra Pomares-Quimbaya et al. BMC Med Res Methodol. 2019.

. 2019 Jul 18;19(1):155.

doi: 10.1186/s12874-019-0792-y.

Authors

Alexandra Pomares-Quimbaya¹, Markus Kreuzthaler², Stefan Schulz²

Affiliations

¹ Pontificia Universidad Javeriana, Cra. 7 No 40-62, Bogotá, 110231, Colombia. pomares@javeriana.edu.co.
² Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Auenbruggerplatz 2, Graz, 8036, Austria.

PMID: 31319802
PMCID: PMC6637496
DOI: 10.1186/s12874-019-0792-y

Abstract

Background: The identification of sections in narrative content of Electronic Health Records (EHR) has demonstrated to improve the performance of clinical extraction tasks; however, there is not yet a shared understanding of the concept and its existing methods. The objective is to report the results of a systematic review concerning approaches aimed at identifying sections in narrative content of EHR, using both automatic or semi-automatic methods.

Methods: This review includes articles from the databases: SCOPUS, Web of Science and PubMed (from January 1994 to September 2018). The selection of studies was done using predefined eligibility criteria and applying the PRISMA recommendations. Search criteria were elaborated by using an iterative and collaborative keyword enrichment.

Results: Following the eligibility criteria, 39 studies were selected for analysis. The section identification approaches proposed by these studies vary greatly depending on the kind of narrative, the type of section, and the application. We observed that 57% of them proposed formal methods for identifying sections and 43% adapted a previously created method. Seventy-eight percent were intended for English texts and 41% for discharge summaries. Studies that are able to identify explicit (with headings) and implicit sections correspond to 46%. Regarding the level of granularity, 54% of the studies are able to identify sections, but not subsections. From the technical point of view, the methods can be classified into rule-based methods (59%), machine learning methods (22%) and a combination of both (19%). Hybrid methods showed better results than those relying on pure machine learning approaches, but lower than rule-based methods; however, their scope was more ambitious than the latter ones. Despite all the promising performance results, very few studies reported tests under a formal setup. Almost all the studies relied on custom dictionaries; however, they used them in conjunction with a controlled terminology, most commonly the UMLSⓇ metathesaurus.

Conclusions: Identification of sections in EHR narratives is gaining popularity for improving clinical extraction projects. This study enabled the community working on clinical NLP to gain a formal analysis of this task, including the most successful ways to perform it.

Keywords: Clinical narrative; Electronic health record; Free text; Machine learning; Natural language processing; Section identification.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

**Fig. 2**
Distribution of performance results

**Fig. 3**
Individual performance results

See this image and copyright information in PMC

References

1. Apostolova E, Channin DS, Demner-Fushman D, Furst J, Lytinen S, Raicu D. Conf Proc IEEE Eng Med Biol Soc.: 5905-8. New York: IEEE; 2009. Automatic segmentation of clinical texts. - PubMed
1. Bodenreider O. The unified medical language system (umls): integrating biomedical terminology. Nucleic Acids Res. 2004;32(suppl 1):D267–70. - PMC - PubMed
1. Bramsen P, Deshpande P, Lee YK, Barzilay R. Finding temporal order in discharge summaries. In: AMIA Annual Symposium. USA: American Medical Informatics Association: 2006. - PMC - PubMed
1. Chapman WW, Savova GK, Zheng J, Tharp M, Crowley R. Anaphoric reference in clinical reports: characteristics of an annotated corpus. J Biomed Inform. 2012;45(3):507–21. - PubMed
1. Chen C, Chang N, Chang Y, Dai H. Section heading recognition in electronic health records using conditional random fields. In: TAAI, volume 8916 of LNCS. Springer: 2014. p. 47–55.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Current approaches to identify sections within clinical narratives from electronic health records: a systematic review

Affiliations

Current approaches to identify sections within clinical narratives from electronic health records: a systematic review

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources