. 2019 Feb:90:103091.

doi: 10.1016/j.jbi.2018.12.005. Epub 2019 Jan 4.

A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications

Affiliations

¹ Department of Health Sciences, University of Wisconsin Milwaukee, Milwaukee, WI, United States; Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States; Section of Medical Informatics, Department of Health Science Research, Mayo Clinic, Rochester, MN, United States. Electronic address: Zolnoori.Maryam@Mayo.edu.
² Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States. Electronic address: kfung@mail.nih.gov.
³ Department of Health Sciences, University of Wisconsin Milwaukee, Milwaukee, WI, United States.
⁴ Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States. Electronic address: pfontelo@mail.nih.gov.
⁵ Department of Health Policy and Management, Johns Hopkins University, Baltimore, MD, United States.
⁶ Department of Biomedical and Health Information Sciences, University of Illinois at Chicago, Chicago, IL, United States.
⁷ School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, United States.
⁸ School of Information, University of South Florida, Tampa, FL, United States.
⁹ Department of Biomedical Informatics, Utah University, Salt Lake City, United States.
¹⁰ Emmes Corporation, Rockville, MD, United States.
¹¹ Department of Epidemiology, Johns Hopkins University, Baltimore, MD, United States.
¹² College of Letters and Science, University of Wisconsin Milwaukee, WI, United States.
¹³ School of Computing and Engineering, University of Missouri-Kansas, Kansas City, MO, United States.

PMID: 30611893
PMCID: PMC12139017
DOI: 10.1016/j.jbi.2018.12.005

A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications

Maryam Zolnoori et al. J Biomed Inform. 2019 Feb.

. 2019 Feb:90:103091.

doi: 10.1016/j.jbi.2018.12.005. Epub 2019 Jan 4.

Authors

Affiliations

¹ Department of Health Sciences, University of Wisconsin Milwaukee, Milwaukee, WI, United States; Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States; Section of Medical Informatics, Department of Health Science Research, Mayo Clinic, Rochester, MN, United States. Electronic address: Zolnoori.Maryam@Mayo.edu.
² Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States. Electronic address: kfung@mail.nih.gov.
³ Department of Health Sciences, University of Wisconsin Milwaukee, Milwaukee, WI, United States.
⁴ Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States. Electronic address: pfontelo@mail.nih.gov.
⁵ Department of Health Policy and Management, Johns Hopkins University, Baltimore, MD, United States.
⁶ Department of Biomedical and Health Information Sciences, University of Illinois at Chicago, Chicago, IL, United States.
⁷ School of Pharmacy, University of Pittsburgh, Pittsburgh, PA, United States.
⁸ School of Information, University of South Florida, Tampa, FL, United States.
⁹ Department of Biomedical Informatics, Utah University, Salt Lake City, United States.
¹⁰ Emmes Corporation, Rockville, MD, United States.
¹¹ Department of Epidemiology, Johns Hopkins University, Baltimore, MD, United States.
¹² College of Letters and Science, University of Wisconsin Milwaukee, WI, United States.
¹³ School of Computing and Engineering, University of Missouri-Kansas, Kansas City, MO, United States.

PMID: 30611893
PMCID: PMC12139017
DOI: 10.1016/j.jbi.2018.12.005

Abstract

"Psychiatric Treatment Adverse Reactions" (PsyTAR) corpus is an annotated corpus that has been developed using patients narrative data for psychiatric medications, particularly SSRIs (Selective Serotonin Reuptake Inhibitor) and SNRIs (Serotonin Norepinephrine Reuptake Inhibitor) medications. This corpus consists of three main components: sentence classification, entity identification, and entity normalization. We split the review posts into sentences and labeled them for presence of adverse drug reactions (ADRs) (2168 sentences), withdrawal symptoms (WDs) (438 sentences), sign/symptoms/illness (SSIs) (789 sentences), drug indications (517), drug effectiveness (EF) (1087 sentences), and drug infectiveness (INF) (337 sentences). In the entity identification phase, we identified and extracted ADRs (4813 mentions), WDs (590 mentions), SSIs (1219 mentions), and DIs (792). In the entity normalization phase, we mapped the identified entities to the corresponding concepts in both UMLS (918 unique concepts) and SNOMED CT (755 unique concepts). Four annotators double coded the sentences and the span of identified entities by strictly following guidelines rules developed for this study. We used the PsyTAR sentence classification component to automatically train a range of supervised machine learning classifiers to identifying text segments with the mentions of ADRs, WDs, DIs, SSIs, EF, and INF. SVMs classifiers had the highest performance with F-Score 0.90. We also measured performance of the cTAKES (clinical Text Analysis and Knowledge Extraction System) in identifying patients' expressions of ADRs and WDs with and without adding PsyTAR dictionary to the core dictionary of cTAKES. Augmenting cTAKES dictionary with PsyTAR improved the F-score cTAKES by 25%. The findings imply that PsyTAR has significant implications for text mining algorithms aimed to identify information about adverse drug events and drug effectiveness from patients' narratives data, by linking the patients' expressions of adverse drug events to medical standard vocabularies. The corpus is publicly available at Zolnoori et al. [30].

Keywords: Adverse drug events; Annotated corpus; Drug effectiveness; Drug safety; Information extraction; Machine learning; Online healthcare forums; Patients narratives; Psychiatric medications; SNOMED CT; SNRIs; SSRIs; Semantic mapping; Social media; Text mining; UMLS.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest

We have no conflict of interest to declare.

Figures

None — Flowchart of Finding Proper Concept for Layperson’s Expression of Medical Entities

**Fig. 1.**
Methodology for developing the corpus. API: Application Programming Interface; ADR: Adverse Drug Reaction; WD: Withdrawal Symptoms; IAA: Inter-Annotator-Agreement; cTAKES: Clinical Text Analysis and Knowledge Extraction System.

**Fig. 2.**
PsyTAR pipeline for automatization of text segment classification and identification of patients’ expression of pharmacological effects associated with Psychiatric medications.

**Fig. 3.**
Performance of cTAKES with and without PsyTAR dictionary for identifying ADRs and WDs from patient drug reviews.

See this image and copyright information in PMC

References

1. Aronson J, Bottled lightning, BMJ 331 (7520) (2005) 824.
1. Benton A, Ungar L, Hill S, Hennessy S, Mao J, Chung A, Holmes JH, Identifying potential adverse effects using the web: a new approach to medical hypothesis generation, J. Biomed. Inform 44 (6) (2011) 989–996. - PMC - PubMed
1. Charan J, Biswas T, How to calculate sample size for different study designs in medical research? Indian J. Psychol. Med 35 (2) (2013) 121–126, 10.4103/0253-7176.116232. - DOI - PMC - PubMed
1. Golder S, Norman G, Loke YK, Systematic review on the prevalence, frequency and comparative value of adverse events data in social media, Br. J. Clin. Pharmacol 80 (4) (2015) 878–888. - PMC - PubMed
1. Gurulingappa H, Klinger R, Hofmann-Apitius M, Fluck J, An empirical evaluation of resources for the identification of diseases and adverse effects in biomedical literature. Paper presented at the 2nd Workshop on Building and evaluating resources for biomedical text mining (7th edition of the Language Resources and Evaluation Conference), 2010.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

Z99 LM999999/ImNIH/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications

Affiliations

A systematic approach for developing a corpus of patient reported adverse drug events: A case study for SSRI and SNRI medications

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources