Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator

S V Ramanan¹, Kedar Radhakrishna², Abijeet Waghmare³, Tony Raj³, Senthil P Nathan⁴, Sai Madhukar Sreerama³, Sriram Sampath⁵

Affiliations

¹ RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India. ramanan@relagent.com.
² Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India. kedar.angirus@gmail.com.
³ Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India.
⁴ RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India.
⁵ Department of Critical Care Medicine, St. John's Medical College, Bangalore, India.

PMID: 27342107
DOI: 10.1007/s10916-016-0541-2

Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator

S V Ramanan et al. J Med Syst. 2016 Aug.

. 2016 Aug;40(8):187.

doi: 10.1007/s10916-016-0541-2. Epub 2016 Jun 24.

Authors

S V Ramanan¹, Kedar Radhakrishna², Abijeet Waghmare³, Tony Raj³, Senthil P Nathan⁴, Sai Madhukar Sreerama³, Sriram Sampath⁵

Affiliations

¹ RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India. ramanan@relagent.com.
² Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India. kedar.angirus@gmail.com.
³ Division of Medical Informatics, St. John's Research Institute, 100 Feet Road, Koramangala, Bangalore, 560034, India.
⁴ RelAgent Technologies (P) Limited, IIT Madras Research Park, #14, 1st Floor, Taramani, Chennai, 600113, India.
⁵ Department of Critical Care Medicine, St. John's Medical College, Bangalore, India.

PMID: 27342107
DOI: 10.1007/s10916-016-0541-2

Abstract

Electronic Health Record (EHR) use in India is generally poor, and structured clinical information is mostly lacking. This work is the first attempt aimed at evaluating unstructured text mining for extracting relevant clinical information from Indian clinical records. We annotated a corpus of 250 discharge summaries from an Intensive Care Unit (ICU) in India, with markups for diseases, procedures, and lab parameters, their attributes, as well as key demographic information and administrative variables such as patient outcomes. In this process, we have constructed guidelines for an annotation scheme useful to clinicians in the Indian context. We evaluated the performance of an NLP engine, Cocoa, on a cohort of these Indian clinical records. We have produced an annotated corpus of roughly 90 thousand words, which to our knowledge is the first tagged clinical corpus from India. Cocoa was evaluated on a test corpus of 50 documents. The overlap F-scores across the major categories, namely disease/symptoms, procedures, laboratory parameters and outcomes, are 0.856, 0.834, 0.961 and 0.872 respectively. These results are competitive with results from recent shared tasks based on US records. The annotated corpus and associated results from the Cocoa engine indicate that unstructured text mining is a viable method for cohort analysis in the Indian clinical context, where structured EHR records are largely absent.

Keywords: Biomedical text extraction; Data mining; Discharge summary; Natural language processing; Text annotation.

PubMed Disclaimer

References

1. Stud Health Technol Inform. 1998;52 Pt 2:874-8 - PubMed
1. J Biomed Inform. 2014 Apr;48:54-65 - PubMed
1. J Am Med Inform Assoc. 2013 Sep-Oct;20(5):806-13 - PubMed
1. J Am Med Inform Assoc. 2010 Sep-Oct;17(5):519-23 - PubMed
1. Gastrointest Endosc. 2012 Jun;75(6):1233-9.e14 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Springer
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator

Affiliations

Dense Annotation of Free-Text Critical Care Discharge Summaries from an Indian Hospital and Associated Performance of a Clinical NLP Annotator

Authors

Affiliations

Abstract

References

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical

Miscellaneous