Recognizing obesity and comorbidities in sparse data

Ozlem Uzuner¹

Affiliations

PMID: 19390096
PMCID: PMC2705260
DOI: 10.1197/jamia.M3115

Recognizing obesity and comorbidities in sparse data

Ozlem Uzuner. J Am Med Inform Assoc. 2009 Jul-Aug.

. 2009 Jul-Aug;16(4):561-70.

doi: 10.1197/jamia.M3115. Epub 2009 Apr 23.

Author

Ozlem Uzuner¹

Affiliation

¹ University at Albany, SUNY, Albany, NY, USA. ouzuner@albany.edu

PMID: 19390096
PMCID: PMC2705260
DOI: 10.1197/jamia.M3115

Abstract

In order to survey, facilitate, and evaluate studies of medical language processing on clinical narratives, i2b2 (Informatics for Integrating Biology to the Bedside) organized its second challenge and workshop. This challenge focused on automatically extracting information on obesity and fifteen of its most common comorbidities from patient discharge summaries. For each patient, obesity and any of the comorbidities could be Present, Absent, or Questionable (i.e., possible) in the patient, or Unmentioned in the discharge summary of the patient. i2b2 provided data for, and invited the development of, automated systems that can classify obesity and its comorbidities into these four classes based on individual discharge summaries. This article refers to obesity and comorbidities as diseases. It refers to the categories Present, Absent, Questionable, and Unmentioned as classes. The task of classifying obesity and its comorbidities is called the Obesity Challenge. The data released by i2b2 was annotated for textual judgments reflecting the explicitly reported information on diseases, and intuitive judgments reflecting medical professionals' reading of the information presented in discharge summaries. There were very few examples of some disease classes in the data. The Obesity Challenge paid particular attention to the performance of systems on these less well-represented classes. A total of 30 teams participated in the Obesity Challenge. Each team was allowed to submit two sets of up to three system runs for evaluation, resulting in a total of 136 submissions. The submissions represented a combination of rule-based and machine learning approaches. Evaluation of system runs shows that the best predictions of textual judgments come from systems that filter the potentially noisy portions of the narratives, project dictionaries of disease names onto the remaining text, apply negation extraction, and process the text through rules. Information on disease-related concepts, such as symptoms and medications, and general medical knowledge help systems infer intuitive judgments on the diseases.

PubMed Disclaimer

References

1. Van Ginneken A, De Wilde M, Van Mulligen E, Stam H. Can data representation and interface demands be reconciled?. Approach in orca. AMIA Annu Symp Proc 1997:779-783. - PMC - PubMed
1. Friedman C, Alderson PO, Austin JH, Cimino JJ, Johnson SB. A general natural-language text processor for clinical radiology J Am Med Inform Assoc 1994;1(2):161-174. - PMC - PubMed
1. Christakis NA, Fowler JH. The spread of obesity in a large social network over 32 years N Engl J Med 2007;357(4):370-379Jul. - PubMed
1. Friedman C, Hripcsak G, Shablinsky I. An evaluation of natural language processing methodologiesAMIA Annu Symp Proc; 1998. pp. 855-859. - PMC - PubMed
1. Grishman R, Sundheim B. Message Understanding Conference-6: A brief history16^th Conference on Computational Linguistics, COLING; 1996. pp. 466-471.

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Recognizing obesity and comorbidities in sparse data

Affiliation

Recognizing obesity and comorbidities in sparse data

Author

Affiliation

Abstract

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical