Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;5(Suppl. 1):43-50.
doi: 10.4137/BII.S8961. Epub 2012 Jan 30.

A hybrid approach to sentiment sentence classification in suicide notes

Affiliations

A hybrid approach to sentiment sentence classification in suicide notes

Sunghwan Sohn et al. Biomed Inform Insights. 2012.

Abstract

This paper describes the sentiment classification system developed by the Mayo Clinic team for the 2011 I2B2/VA/Cincinnati Natural Language Processing (NLP) Challenge. The sentiment classification task is to assign any pertinent emotion to each sentence in suicide notes. We have implemented three systems that have been trained on suicide notes provided by the I2B2 challenge organizer-a machine learning system, a rule-based system, and a system consisting of a combination of both. Our machine learning system was trained on re-annotated data in which apparently inconsistent emotion assignment was adjusted. Then, the machine learning methods by RIPPER and multinomial Naïve Bayes classifiers, manual pattern matching rules, and the combination of the two systems were tested to determine the emotions within sentences. The combination of the machine learning and rule-based system performed best and produced a micro-average F-score of 0.5640.

Keywords: machine learning; natural language processing; sentiment classification; suicidal emotion.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Statistics of 600 suicide notes in the training set.
Figure 2.
Figure 2.
Summary of pattern matching rules.

References

    1. O’Connor R, Sheehy N. Understanding suicidal behaviour. Leicester: BPS Books; 2000.
    1. Sussman M, Jones S, Wilson T, Kann L. The Youth Risk Behavior Surveillance System: updating policy and program applications. J Sch Health. 2002;72(1):13–7. - PubMed
    1. Pestian J, Nasrallah H, Matykiewicz P, Bennett A, Leenaars A. Suicide Note Classification Using Natural Language Processing: A Content Analysis. Biomedical Informatics Insights. 2010;3:19–28. - PMC - PubMed
    1. Matykiewicz P, Duch W, Pestian J. Clustering semantic spaces of suicide notes and newsgroups articles. BioNLP ′09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing; 2009.
    1. Huang Y-P, Goh T, Li C. Hunting Suicide Notes in Web 20 – Preliminary Findings. Ninth IEEE International Symposium on Multimedia 2007— Workshops; 2007. pp. 517–21.

LinkOut - more resources