Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jul 2;4(3):e78.
doi: 10.2196/resprot.3433.

Collecting and Analyzing Patient Experiences of Health Care From Social Media

Affiliations

Collecting and Analyzing Patient Experiences of Health Care From Social Media

Majid Rastegar-Mojarad et al. JMIR Res Protoc. .

Abstract

Background: Social Media, such as Yelp, provides rich information of consumer experience. Previous studies suggest that Yelp can serve as a new source to study patient experience. However, the lack of a corpus of patient reviews causes a major bottleneck for applying computational techniques.

Objective: The objective of this study is to create a corpus of patient experience (COPE) and report descriptive statistics to characterize COPE.

Methods: Yelp reviews about health care-related businesses were extracted from the Yelp Academic Dataset. Natural language processing (NLP) tools were used to split reviews into sentences, extract noun phrases and adjectives from each sentence, and generate parse trees and dependency trees for each sentence. Sentiment analysis techniques and Hadoop were used to calculate a sentiment score of each sentence and for parallel processing, respectively.

Results: COPE contains 79,173 sentences from 6914 patient reviews of 985 health care facilities near 30 universities in the United States. We found that patients wrote longer reviews when they rated the facility poorly (1 or 2 stars). We demonstrated that the computed sentiment scores correlated well with consumer-generated ratings. A consumer vocabulary to describe their health care experience was constructed by a statistical analysis of word counts and co-occurrences in COPE.

Conclusions: A corpus called COPE was built as an initial step to utilize social media to understand patient experiences at health care facilities. The corpus is available to download and COPE can be used in future studies to extract knowledge of patients' experiences from their perspectives. Such information can subsequently inform and provide opportunity to improve the quality of health care.

Keywords: consumer health information; health care; natural language processing; patient satisfaction; social media.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1
Figure 1
Part-of-speech analysis conducted on each sentence in COPE.
Figure 2
Figure 2
Number of reviews per years.
Figure 3
Figure 3
Distribution of reviews.
Figure 4
Figure 4
Distribution of review length.
Figure 5
Figure 5
Number of sentences per review.
Figure 6
Figure 6
Distribution of the rating scores per review.
Figure 7
Figure 7
Length of review versus rating score.
Figure 8
Figure 8
Distribution of the sentiment score per sentence.
Figure 9
Figure 9
Mean sentiment score of sentences in each review per patient-generated overall rating.
Figure 10
Figure 10
Sentiment score per length of sentence.
Figure 11
Figure 11
Cumulative usage of terms versus rank of terms.
Figure 12
Figure 12
A network of words used by customers to describe their experiences. The size of the node indicates the frequency of the word and the width of the lines indicates the number of co-occurrences of the word-pair in the same review. An example of usage of the word “platelet” is shown in the call-out box.

References

    1. Hanauer D, Zheng K, Singer D, Gebremariam A, Davis M. PUblic awareness, perception, and use of online physician rating sites. JAMA. 2014 Feb - PubMed
    1. PwC Scoring healthcare: Navigating customer experience ratings. [2014-03-07]. http://www.pwc.com/us/en/health-industries/publications/scoring-patient-... .
    1. Keckley P, Coughlin S, Eselius L. 2011 Survey of Health Care Consumers in the United States: Key Findings, Strategic Implications. Washington, DC: Deloitte Center for Health Solutions; 2011.
    1. Shoemaker P. What value-based purchasing means to your hospital. Healthc Financ Manage. 2011 Aug;65(8):60–8. - PubMed
    1. Zusman EE. HCAHPS replaces Press Ganey survey as quality measure for patient hospital experience. Neurosurgery. 2012 Aug;71(2):N21–4. doi: 10.1227/01.neu.0000417536.07871.ed.00006123-201208000-00006 - DOI - PubMed

LinkOut - more resources