Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020;26(4):221-231.
doi: 10.46292/sci20-00015. Epub 2021 Jan 20.

Linking Individual Data From the Spinal Cord Injury Model Systems Center and Local Trauma Registry: Development and Validation of Probabilistic Matching Algorithm

Affiliations

Linking Individual Data From the Spinal Cord Injury Model Systems Center and Local Trauma Registry: Development and Validation of Probabilistic Matching Algorithm

Yuying Chen et al. Top Spinal Cord Inj Rehabil. 2020.

Abstract

Background: Linking records from the National Spinal Cord Injury Model Systems (SCIMS) database to the National Trauma Data Bank (NTDB) provides a unique opportunity to study early variables in predicting long-term outcomes after traumatic spinal cord injury (SCI). The public use data sets of SCIMS and NTDB are stripped of protected health information, including dates and zip code.

Objectives: To develop and validate a probabilistic algorithm linking data from an SCIMS center and its affiliated trauma registry.

Method: Data on SCI admissions 2011-2018 were retrieved from an SCIMS center (n = 302) and trauma registry (n = 723), of which 202 records had the same medical record number. The SCIMS records were divided equally into two data sets for algorithm development and validation, respectively. We used a two-step approach: blocking and weight generation for linking variables (race, insurance, height, and weight).

Results: In the development set, 257 SCIMS-trauma pairs shared the same sex, age, and injury year across 129 clusters, of which 91 records were true-match. The probabilistic algorithm identified 65 of the 91 true-match records (sensitivity, 71.4%) with a positive predictive value (PPV) of 80.2%. The algorithm was validated over 282 SCIMS-trauma pairs across 127 clusters and had a sensitivity of 73.7% and PPV of 81.1%. Post hoc analysis shows the addition of injury date and zip code improved the specificity from 57.9% to 94.7%.

Conclusion: We demonstrate the feasibility of probabilistic linkage between SCIMS and trauma records, which needs further refinement and validation. Gaining access to injury date and zip code would improve record linkage significantly.

Keywords: data linkage; databases; rehabilitation; spinal cord injuries; trauma.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest The authors declare no conflicts of interest.

Figures

Figure 1.
Figure 1.
Frequency distribution of total weights (Wtotal) among 257 SCIMS-trauma pairs in the training set: the highest weight within each cluster (black bar) versus all other weights (white bar). Grey bar represents the overlapping of the highest and all other weights. (A) Probabilistic linkage with race, insurance, height, and body weight. (B) Probabilistic linkage with race, insurance, height, body weight, injury date, and residential zip code.
Figure 2.
Figure 2.
Accuracy of record linkage in the training set presented by the receiver operating characteristic curve. The dotted line represents the probabilistic linkage with race, insurance, height, and body weight. The solid line represents the probabilistic linkage with race, insurance, height, body weight, injury date, and residential zip code. Each black dot indicates a different cutoff value of total weight (Wtotal).

Similar articles

Cited by

References

    1. Chen Y, DeVivo MJ, Richards JS, SanAgustin TB. Spinal Cord Injury Model Systems: Review of program and national database from 1970 to 2015. Arch Phys Med Rehabil. 2016;97(10):1797–1804. - PubMed
    1. DeVivo MJ, Jackson AB, Dijkers MP, Becker BE. Current research outcomes from the Model Spinal Cord Injury Care Systems. Arch Phys Med Rehabil. 1999;80:1363–1364. - PubMed
    1. Lammertse DP, Jackson AB, Sipski ML. Research from the Model Spinal Cord Injury Systems: Findings from the current 5-year grant cycle. Arch Phys Med Rehabil. 2004;85(11):1737–1739. - PubMed
    1. Chen Y, Deutsch A, DeVivo MJ et al. Current research outcomes from the spinal cord injury model systems. Arch Phys Med Rehabil. 2011;92(3):329–331. - PubMed
    1. Nemunaitis G, Roach MJ, Claridge J, Mejia M. Early predictors of functional outcome after trauma. PM R. 2016;8(4):314–320. - PubMed

Publication types

LinkOut - more resources