Creation of a new longitudinal corpus of clinical narratives
- PMID: 26433122
- PMCID: PMC4978168
- DOI: 10.1016/j.jbi.2015.09.018
Creation of a new longitudinal corpus of clinical narratives
Abstract
The 2014 i2b2/UTHealth Natural Language Processing (NLP) shared task featured a new longitudinal corpus of 1304 records representing 296 diabetic patients. The corpus contains three cohorts: patients who have a diagnosis of coronary artery disease (CAD) in their first record, and continue to have it in subsequent records; patients who do not have a diagnosis of CAD in the first record, but develop it by the last record; patients who do not have a diagnosis of CAD in any record. This paper details the process used to select records for this corpus and provides an overview of novel research uses for this corpus. This corpus is the only annotated corpus of longitudinal clinical narratives currently available for research to the general research community.
Keywords: Corpus; Machine learning; Medical records; NLP.
Copyright © 2015 Elsevier Inc. All rights reserved.
Figures
References
-
- Hersh William, Buckley Chris, Leone TJ, Hickam David. OHSUMED: an interactive retrieval evaluation and new large test collection for research. In: Bruce Croft W, van Rijsbergen CJ, editors. Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '94) Springer-Verlag New York, Inc.; New York, NY, USA: 1994. pp. 192–201.
-
- Yeh Alexander, Hirschman Lynette, Morgan Alexander. Background and overview for KDD Cup 2002 task 1: information extraction from biomedical articles. SIGKDD Explor. Newsl. 2002 2002 Dec;4(2):87–89. DOI=10.1145/772862.772873 http://doi.acm.org/10.1145/772862.772873. - DOI
-
- Hersh William, Voorhees Ellen. TREC genomics special issue overview. Information Retrieval. 2008;12:1–15.
-
- Clifford GD, Scott DJ, Villarroel M. User Guide and Documentation for the MIMIC II Database 2012, database version 2.6. available online: https://mimic.physionet.org/UserGuide/UserGuide.html.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous