A General Propensity Score for Signal Identification Using Tree-Based Scan Statistics
- PMID: 33615330
- DOI: 10.1093/aje/kwab034
A General Propensity Score for Signal Identification Using Tree-Based Scan Statistics
Abstract
The tree-based scan statistic (TreeScan; Martin Kulldorff, Harvard Medical School, Boston, Massachusetts) is a data-mining method that adjusts for multiple testing of correlated hypotheses when screening thousands of potential adverse events for signal identification. Simulation has demonstrated the promise of TreeScan with a propensity score (PS)-matched cohort design. However, it is unclear which variables to include in a PS for applied signal identification studies to simultaneously adjust for confounding across potential outcomes. We selected 4 pairs of medications with well-understood safety profiles. For each pair, we evaluated 5 candidate PSs with different combinations of 1) predefined general covariates (comorbidity, frailty, utilization), 2) empirically selected (data-driven) covariates, and 3) covariates tailored to the drug pair. For each pair, statistical alerting patterns were similar with alternative PSs (≤11 alerts in 7,996 outcomes scanned). Inclusion of covariates tailored to exposure did not appreciably affect screening results. Inclusion of empirically selected covariates can provide better proxy coverage for confounders but can also decrease statistical power. Unlike tailored covariates, empirical and predefined general covariates can be applied "out of the box" for signal identification. The choice of PS depends on the level of concern about residual confounding versus loss of power. Potential signals should be followed by pharmacoepidemiologic assessment where confounding control is tailored to the specific outcome(s) under investigation.
Keywords: TreeScan; propensity score; real-world data; signal identification.
Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health 2021. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Similar articles
-
Data Mining for Adverse Drug Events With a Propensity Score-matched Tree-based Scan Statistic.Epidemiology. 2018 Nov;29(6):895-903. doi: 10.1097/EDE.0000000000000907. Epidemiology. 2018. PMID: 30074538 Free PMC article.
-
Active Surveillance of the Safety of Medications Used During Pregnancy.Am J Epidemiol. 2021 Jun 1;190(6):1159-1168. doi: 10.1093/aje/kwaa288. Am J Epidemiol. 2021. PMID: 33423046
-
Drug safety data mining with a tree-based scan statistic.Pharmacoepidemiol Drug Saf. 2013 May;22(5):517-23. doi: 10.1002/pds.3423. Epub 2013 Mar 20. Pharmacoepidemiol Drug Saf. 2013. PMID: 23512870
-
[Review on tree-based scan statistic in drug and vaccine safety monitoring].Zhonghua Liu Xing Bing Xue Za Zhi. 2021 Jul 10;42(7):1286-1291. doi: 10.3760/cma.j.cn112338-20201103-01297. Zhonghua Liu Xing Bing Xue Za Zhi. 2021. PMID: 34814545 Review. Chinese.
-
Indications for propensity scores and review of their use in pharmacoepidemiology.Basic Clin Pharmacol Toxicol. 2006 Mar;98(3):253-9. doi: 10.1111/j.1742-7843.2006.pto_293.x. Basic Clin Pharmacol Toxicol. 2006. PMID: 16611199 Free PMC article. Review.
Cited by
-
High-Throughput Computing to Automate Population-Based Studies to Detect the 30-Day Risk of Adverse Outcomes After New Outpatient Medication Use in Older Adults with Chronic Kidney Disease: A Clinical Research Protocol.Can J Kidney Health Dis. 2024 Jan 6;11:20543581231221891. doi: 10.1177/20543581231221891. eCollection 2024. Can J Kidney Health Dis. 2024. PMID: 38186562 Free PMC article.
-
Interplay of Spontaneous Reporting and Longitudinal Healthcare Databases for Signal Management: Position Statement from the Real-World Evidence and Big Data Special Interest Group of the International Society of Pharmacovigilance.Drug Saf. 2025 Sep;48(9):959-976. doi: 10.1007/s40264-025-01548-3. Epub 2025 Apr 13. Drug Saf. 2025. PMID: 40223041 Free PMC article.
-
Applying Machine Learning in Distributed Data Networks for Pharmacoepidemiologic and Pharmacovigilance Studies: Opportunities, Challenges, and Considerations.Drug Saf. 2022 May;45(5):493-510. doi: 10.1007/s40264-022-01158-3. Epub 2022 May 17. Drug Saf. 2022. PMID: 35579813 Free PMC article. Review.
-
Broadening the reach of the FDA Sentinel system: A roadmap for integrating electronic health record data in a causal analysis framework.NPJ Digit Med. 2021 Dec 20;4(1):170. doi: 10.1038/s41746-021-00542-0. NPJ Digit Med. 2021. PMID: 34931012 Free PMC article. Review.
-
Investigation of the potential association between the use of fluoxetine and occurrence of acute pancreatitis: a Danish register-based cohort study.Int J Epidemiol. 2022 Oct 13;51(5):1656-1665. doi: 10.1093/ije/dyac071. Int J Epidemiol. 2022. PMID: 35472246 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources