Linking Individual Data From the Spinal Cord Injury Model Systems Center and Local Trauma Registry: Development and Validation of Probabilistic Matching Algorithm
- PMID: 33536727
- PMCID: PMC7831288
- DOI: 10.46292/sci20-00015
Linking Individual Data From the Spinal Cord Injury Model Systems Center and Local Trauma Registry: Development and Validation of Probabilistic Matching Algorithm
Abstract
Background: Linking records from the National Spinal Cord Injury Model Systems (SCIMS) database to the National Trauma Data Bank (NTDB) provides a unique opportunity to study early variables in predicting long-term outcomes after traumatic spinal cord injury (SCI). The public use data sets of SCIMS and NTDB are stripped of protected health information, including dates and zip code.
Objectives: To develop and validate a probabilistic algorithm linking data from an SCIMS center and its affiliated trauma registry.
Method: Data on SCI admissions 2011-2018 were retrieved from an SCIMS center (n = 302) and trauma registry (n = 723), of which 202 records had the same medical record number. The SCIMS records were divided equally into two data sets for algorithm development and validation, respectively. We used a two-step approach: blocking and weight generation for linking variables (race, insurance, height, and weight).
Results: In the development set, 257 SCIMS-trauma pairs shared the same sex, age, and injury year across 129 clusters, of which 91 records were true-match. The probabilistic algorithm identified 65 of the 91 true-match records (sensitivity, 71.4%) with a positive predictive value (PPV) of 80.2%. The algorithm was validated over 282 SCIMS-trauma pairs across 127 clusters and had a sensitivity of 73.7% and PPV of 81.1%. Post hoc analysis shows the addition of injury date and zip code improved the specificity from 57.9% to 94.7%.
Conclusion: We demonstrate the feasibility of probabilistic linkage between SCIMS and trauma records, which needs further refinement and validation. Gaining access to injury date and zip code would improve record linkage significantly.
Keywords: data linkage; databases; rehabilitation; spinal cord injuries; trauma.
© 2020 American Spinal Injury Association.
Conflict of interest statement
Conflicts of Interest The authors declare no conflicts of interest.
Figures
References
-
- Chen Y, DeVivo MJ, Richards JS, SanAgustin TB. Spinal Cord Injury Model Systems: Review of program and national database from 1970 to 2015. Arch Phys Med Rehabil. 2016;97(10):1797–1804. - PubMed
-
- DeVivo MJ, Jackson AB, Dijkers MP, Becker BE. Current research outcomes from the Model Spinal Cord Injury Care Systems. Arch Phys Med Rehabil. 1999;80:1363–1364. - PubMed
-
- Lammertse DP, Jackson AB, Sipski ML. Research from the Model Spinal Cord Injury Systems: Findings from the current 5-year grant cycle. Arch Phys Med Rehabil. 2004;85(11):1737–1739. - PubMed
-
- Chen Y, Deutsch A, DeVivo MJ et al. Current research outcomes from the spinal cord injury model systems. Arch Phys Med Rehabil. 2011;92(3):329–331. - PubMed
-
- Nemunaitis G, Roach MJ, Claridge J, Mejia M. Early predictors of functional outcome after trauma. PM R. 2016;8(4):314–320. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical