Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Affiliations

¹ Department of Biomedical Informatics, Emory University, Atlanta, GA.
² Department of Computer Science, NSUT, India.
³ Department of Neurology, Mayo Clinic, Rochester, MN.
⁴ Department of Pharmacy Services, Mayo Clinic Health System, Austin, MN.
⁵ Department of Neurology, Mayo Clinic, Scottsdale, AZ.
⁶ Department of Psychology, University of North Texas, TX.
⁷ Department of Cardiology, Mayo Clinic, Rochester, MN.
⁸ Department of Radiology, Mayo Clinic, Rochester, AZ.
⁹ Department of Computer Science, Arizona State University, Tempe, AZ.

PMID: 37350878
PMCID: PMC10283091

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Yuting Guo et al. AMIA Jt Summits Transl Sci Proc. 2023.

. 2023 Jun 16:2023:261-270.

eCollection 2023.

Authors

Affiliations

¹ Department of Biomedical Informatics, Emory University, Atlanta, GA.
² Department of Computer Science, NSUT, India.
³ Department of Neurology, Mayo Clinic, Rochester, MN.
⁴ Department of Pharmacy Services, Mayo Clinic Health System, Austin, MN.
⁵ Department of Neurology, Mayo Clinic, Scottsdale, AZ.
⁶ Department of Psychology, University of North Texas, TX.
⁷ Department of Cardiology, Mayo Clinic, Rochester, MN.
⁸ Department of Radiology, Mayo Clinic, Rochester, AZ.
⁹ Department of Computer Science, Arizona State University, Tempe, AZ.

PMID: 37350878
PMCID: PMC10283091

Abstract

Migraine is a highly prevalent and disabling neurological disorder. However, information about migraine management in real-world settings is limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by those with migraine; (ii) develop a platform-independent text classification system for automatically detecting self-reported migraine-related posts, and (iii) conduct analyses of the self-reported posts to assess the utility of social media for studying this problem. We manually annotated 5750 Twitter posts and 302 Reddit posts, and used them for training and evaluating supervised machine learning methods. Our best system achieved an F₁ score of 0.90 on Twitter and 0.93 on Reddit. Analysis of information posted by our 'migraine cohort' revealed the presence of a plethora of relevant information about migraine therapies and sentiments associated with them. Our study forms the foundation for conducting an in-depth analysis of migraine-related information using social media data.

PubMed Disclaimer

Figures

**Figure 1.**
The framework of our generalizable NLP system—model development and validation on Twitter data followed by additional evaluation on Reddit posts.

**Figure 2.**
Examples from the bias analysis. The green color signifies positive attention to the words while red shows negative.

**Figure 3.**
The normalized sentiment distributions of the medications: (a) Twitter and (b) Reddit.

See this image and copyright information in PMC

References

1. Nittas V, Lun P, Ehrler F, Puhan MA, Mütsch M. Electronic Patient-Generated Health Data to Facilitate Disease Prevention and Health Promotion: Scoping Review. J Med Internet Res. 2019;21(10):e13320. doi:10.2196/13320. - PMC - PubMed
1. Conway M, Hu M, Chapman WW. Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data. Yearbook of medical informatics. 2019;28(1):208–217. doi:10.1055/s-0039-1677918. - PMC - PubMed
1. Gonzalez-Hernandez G, Sarker A, O’Connor K, Savova G. Capturing the Patient’s Perspective: a Review of Advances in Natural Language Processing of Health-Related Text. Yearb Med Inform. 2017;26(1):214–227. - PMC - PubMed
1. Paul MJ, Sarker A, Brownstein JS, et al. Social Media Mining for Public Health Monitoring and Surveillance. Pacific Symposium on Biocomputing. World Scientific Publishing Co. Pte Ltd. 2016:468–479. doi:10.1142/9789814749411_0043.
1. Ravindranath S, Zhao C, Tgavalekos K. Patient Status Indicator to Extract Key Temporal Changes in Continuous-Time Deterioration Risk Score. Critical Care Medicine. 2021;49(1) https://journals.lww.com/ccmjournal/Fulltext/2021/01001/374_Patient_Stat... .

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Affiliations

Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

Authors

Affiliations

Abstract

Figures

References

LinkOut - more resources

Full Text Sources