Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Apr 11;10(1):6.
doi: 10.1186/s13326-019-0198-0.

Moonstone: a novel natural language processing system for inferring social risk from clinical narratives

Affiliations

Moonstone: a novel natural language processing system for inferring social risk from clinical narratives

Mike Conway et al. J Biomed Semantics. .

Abstract

Background: Social risk factors are important dimensions of health and are linked to access to care, quality of life, health outcomes and life expectancy. However, in the Electronic Health Record, data related to many social risk factors are primarily recorded in free-text clinical notes, rather than as more readily computable structured data, and hence cannot currently be easily incorporated into automated assessments of health. In this paper, we present Moonstone, a new, highly configurable rule-based clinical natural language processing system designed to automatically extract information that requires inferencing from clinical notes. Our initial use case for the tool is focused on the automatic extraction of social risk factor information - in this case, housing situation, living alone, and social support - from clinical notes. Nursing notes, social work notes, emergency room physician notes, primary care notes, hospital admission notes, and discharge summaries, all derived from the Veterans Health Administration, were used for algorithm development and evaluation.

Results: An evaluation of Moonstone demonstrated that the system is highly accurate in extracting and classifying the three variables of interest (housing situation, living alone, and social support). The system achieved positive predictive value (i.e. precision) scores ranging from 0.66 (homeless/marginally housed) to 0.98 (lives at home/not homeless), accuracy scores ranging from 0.63 (lives in facility) to 0.95 (lives alone), and sensitivity (i.e. recall) scores ranging from 0.75 (lives in facility) to 0.97 (lives alone).

Conclusions: The Moonstone system is - to the best of our knowledge - the first freely available, open source natural language processing system designed to extract social risk factors from clinical text with good (lives in facility) to excellent (lives alone) performance. Although developed with the social risk factor identification task in mind, Moonstone provides a powerful tool to address a range of clinical natural language processing tasks, especially those tasks that require nuanced linguistic processing in conjunction with inference capabilities.

Keywords: Natural language processing; Social determinants of health; Software.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

The study was approved by the University of Utah’s Institutional Review Board (IRB_00070714).

Consent for publication

Not applicable.

Competing interests

All authors declare that they have no competing interest.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
System architecture. Ovals (green) are knowledge resources, and rectangles (blue) are Moonstone components
Fig. 2
Fig. 2
TSL example
Fig. 3
Fig. 3
Semantic grammar rule
Fig. 4
Fig. 4
Word-level grammar rule mapping phrases to normalizing constant “:SPOUSE:”. Note that these rules were defined for the US healthcare system and hence may not prove appropriate for non-US contexts
Fig. 5
Fig. 5
TSL rule used to augment grammatical analysis
Fig. 6
Fig. 6
Parse tree for “patient lives with his wife at home”

References

    1. Ye Y, Tsui FR, Wagner M, Espino JU, Li Q. Influenza detection from emergency department reports using natural language processing and Bayesian network classifiers. J Am Med Inform Assoc. 2014;5(21):815–23. doi: 10.1136/amiajnl-2013-001934. - DOI - PMC - PubMed
    1. Reblin M, Uchino BN. Social and emotional support and its implication for health. Curr Opin Psychiatry. 2008;2(21):201–5. doi: 10.1097/YCO.0b013e3282f3ad89. - DOI - PMC - PubMed
    1. Weinreich M, Nguyen OK, Wang D, Mayo H, Mortensen EM, Halm EA, Makam AN. Predicting the risk of readmission in pneumonia. a systematic review of model performance. Ann Am Thorac Soc. 2016;9(13):1607–14. doi: 10.1513/AnnalsATS.201602-135SR. - DOI - PMC - PubMed
    1. Calvillo-King L, Arnold D, Eubank K, Lo M, Yunyongying P, Stieglitz H. Halm E: Impact of social factors on risk of readmission or mortality in pneumonia and heart failure: systematic review. J Gen Intern Med. 2013;2(28):269–82. doi: 10.1007/s11606-012-2235-x. - DOI - PMC - PubMed
    1. National Quality Forum: Disparities in Healthcare and Health Outcomes in Selected Conditions. Tech rep. 2017. www.qualityforum.org/Projects/c-d/Disparities/Final_Report.aspx.

Publication types