Review

. 2020 Jan 28;8(1):e16023.

doi: 10.2196/16023.

Sentiment Analysis in Health and Well-Being: Systematic Review

Anastazia Zunic¹, Padraig Corcoran¹, Irena Spasic¹

Affiliations

PMID: 32012057
PMCID: PMC7013658
DOI: 10.2196/16023

Review

Sentiment Analysis in Health and Well-Being: Systematic Review

Anastazia Zunic et al. JMIR Med Inform. 2020.

. 2020 Jan 28;8(1):e16023.

doi: 10.2196/16023.

Authors

Anastazia Zunic¹, Padraig Corcoran¹, Irena Spasic¹

Affiliation

¹ School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom.

PMID: 32012057
PMCID: PMC7013658
DOI: 10.2196/16023

Abstract

Background: Sentiment analysis (SA) is a subfield of natural language processing whose aim is to automatically classify the sentiment expressed in a free text. It has found practical applications across a wide range of societal contexts including marketing, economy, and politics. This review focuses specifically on applications related to health, which is defined as "a state of complete physical, mental, and social well-being and not merely the absence of disease or infirmity."

Objective: This study aimed to establish the state of the art in SA related to health and well-being by conducting a systematic review of the recent literature. To capture the perspective of those individuals whose health and well-being are affected, we focused specifically on spontaneously generated content and not necessarily that of health care professionals.

Methods: Our methodology is based on the guidelines for performing systematic reviews. In January 2019, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified a total of 86 relevant studies and extracted data about the datasets analyzed, discourse topics, data creators, downstream applications, algorithms used, and their evaluation.

Results: The majority of data were collected from social networking and Web-based retailing platforms. The primary purpose of online conversations is to exchange information and provide social support online. These communities tend to form around health conditions with high severity and chronicity rates. Different treatments and services discussed include medications, vaccination, surgery, orthodontic services, individual physicians, and health care services in general. We identified 5 roles with respect to health and well-being among the authors of the types of spontaneously generated narratives considered in this review: a sufferer, an addict, a patient, a carer, and a suicide victim. Out of 86 studies considered, only 4 reported the demographic characteristics. A wide range of methods were used to perform SA. Most common choices included support vector machines, naïve Bayesian learning, decision trees, logistic regression, and adaptive boosting. In contrast with general trends in SA research, only 1 study used deep learning. The performance lags behind the state of the art achieved in other domains when measured by F-score, which was found to be below 60% on average. In the context of SA, the domain of health and well-being was found to be resource poor: few domain-specific corpora and lexica are shared publicly for research purposes.

Conclusions: SA results in the area of health and well-being lag behind those in other domains. It is yet unclear if this is because of the intrinsic differences between the domains and their respective sublanguages, the size of training datasets, the lack of domain-specific sentiment lexica, or the choice of algorithms.

Keywords: machine learning; natural language processing; sentiment analysis; text mining.

©Anastazia Zunic, Padraig Corcoran, Irena Spasic. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 28.01.2020.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

**Figure 1**
Flow diagram of the literature review process.

**Figure 2**
The representation of the UMLS in sentiment lexica.

See this image and copyright information in PMC

Cited by

Artificial Intelligence Surgery: How Do We Get to Autonomous Actions in Surgery?
Gumbs AA, Frigerio I, Spolverato G, Croner R, Illanes A, Chouillard E, Elyan E. Gumbs AA, et al. Sensors (Basel). 2021 Aug 17;21(16):5526. doi: 10.3390/s21165526. Sensors (Basel). 2021. PMID: 34450976 Free PMC article. Review.
A Tale of Two Cities: COVID-19 and the Emotional Well-Being of Student-Athletes Using Natural Language Processing.
Floyd C, Gulavani SS, Du J, Kim ACH, Pappas J. Floyd C, et al. Front Sports Act Living. 2021 Aug 25;3:710289. doi: 10.3389/fspor.2021.710289. eCollection 2021. Front Sports Act Living. 2021. PMID: 34514388 Free PMC article.
Uncovering the Complexity of Perinatal Polysubstance Use Disclosure Patterns on X: Mixed Methods Study.
Wu D, Shead H, Ren Y, Raynor P, Tao Y, Villanueva H, Hung P, Li X, Brookshire RG, Eichelberger K, Guille C, Litwin AH, Olatosi B. Wu D, et al. J Med Internet Res. 2024 Sep 20;26:e53171. doi: 10.2196/53171. J Med Internet Res. 2024. PMID: 39302713 Free PMC article.
Spontaneously Generated Online Patient Experience of Modafinil: A Qualitative and NLP Analysis.
Walsh J, Cave J, Griffiths F. Walsh J, et al. Front Digit Health. 2021 Feb 17;3:598431. doi: 10.3389/fdgth.2021.598431. eCollection 2021. Front Digit Health. 2021. PMID: 34713085 Free PMC article.
The Patient Generated Index (PGI) as an early-warning system for predicting brain health challenges: a prospective cohort study for people living with Human Immunodeficiency Virus (HIV).
Humayun MM, Brouillette MJ, Fellows LK, Mayo NE. Humayun MM, et al. Qual Life Res. 2023 Dec;32(12):3439-3452. doi: 10.1007/s11136-023-03475-1. Epub 2023 Jul 10. Qual Life Res. 2023. PMID: 37428407

See all "Cited by" articles

References

1. Wiebe J, Bruce R. Probabilistic classifiers for tracking point of view. Progress in communication sciences. 1995:125–42. https://pdfs.semanticscholar.org/033e/414b82a6c20f6ed7e0b5232a1ae36d54e7...
1. Hatzivassiloglou V, McKeown KR. Predicting the Semantic Orientation of Adjectives. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics; ACL'98/EACL'98; July 7-12, 1997; Madrid, Spain. 1997. pp. 174–81. https://www.aclweb.org/anthology/P97-1023/ - DOI
1. Wiebe JM, Bruce RF, O'Hara TP. Development and Use of a Gold-standard Data Set for Subjectivity Classifications. Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics; ACL'99; June 20-26, 1999; College Park, Maryland, USA. 1999. pp. 246–53. https://www.aclweb.org/anthology/P99-1032/ - DOI
1. Hu M, Liu B. Mining Opinion Features in Customer Reviews. Proceedings of the 19th national conference on Artifical intelligence; AAAI'04; July 25 - 29, 2004; San Jose, California, USA. 2004. pp. 755–60. https://dl.acm.org/citation.cfm?id=1597269
1. Hu M, Liu B. Mining and Summarizing Customer Reviews. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining; KDD'04; August 22 - 25, 2004; Seattle, Washington, USA. 2004. pp. 168–77. https://dl.acm.org/citation.cfm?id=1014073 - DOI

Publication types

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Sentiment Analysis in Health and Well-Being: Systematic Review

Affiliation

Sentiment Analysis in Health and Well-Being: Systematic Review

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

LinkOut - more resources

Full Text Sources