Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 24;14(1):152.
doi: 10.1186/s13023-019-1125-2.

De-duplicating patient records from three independent data sources reveals the incidence of rare neuromuscular disorders in Germany

Affiliations

De-duplicating patient records from three independent data sources reveals the incidence of rare neuromuscular disorders in Germany

Kirsten König et al. Orphanet J Rare Dis. .

Abstract

Background: Estimation of incidence in rare diseases is often challenging due to unspecific and incomplete coding and recording systems. Patient- and health care provider-driven data collections are held with different organizations behind firewalls to protect the privacy of patients. They tend to be fragmented, incomplete and their aggregation leads to further inaccuracies, as the duplicated records cannot easily be identified. We here report about a novel approach to evaluate the incidences of Duchenne muscular dystrophy (DMD) and spinal muscular atrophy (SMA) in Germany.

Methods: We performed a retrospective epidemiological study collecting data from patients with dystrophinopathies (DMD and Becker muscular dystrophy) and SMA born between 1995 and 2018. We invited all neuromuscular centers, genetic institutes and the patient registries for DMD and SMA in Germany to participate in the data collection. A novel web-based application for data entry was developed converting patient identifying information into a hash code. Duplicate entries were reliably allocated to the distinct patient.

Results: We collected 5409 data entries in our web-based database representing 1955 distinct patients with dystrophinopathies and 1287 patients with SMA. 55.0% of distinct patients were found in one of the 3 data sources only, while 32.0% were found in 2, and 13.0% in all 3 data sources. The highest number of SMA patients was reported by genetic testing laboratories, while for DMD the highest number was reported by the clinical specialist centers. After the removal of duplicate records, the highest yearly incidence for DMD was calculated as 2.57:10,000 in 2001 and the highest incidence for SMA as 1.36:10,000 in 2014.

Conclusion: With our novel approach (compliant with data protection regulations), we were able to identify unique patient records and estimate the incidence of DMD and SMA in Germany combining and de-duplicating data from patient registries, genetic institutes, and clinical care centers. Although we combined three different data sources, an unknown number of patients might not have been reported by any of these sources. Therefore, our results reflect the minimal incidence of these diseases.

Keywords: Duchenne muscular dystrophy; Incidence; Neuromuscular disease; Spinal muscular atrophy.

PubMed Disclaimer

Conflict of interest statement

JK received research funding and/or compensation for presentations and consultancy from Avexis, Biogen, Ionis Pharmaceuticals, Novartis, and Roche. HL received research funding and/or compensation for presentations and consultancy from AMO Pharma, Biogen, Desitin, Roche, Santhera, Sarepta, Satellos, Ultragenyx. AP received compensations for presentations and training activities from Biogen. DS participated in workshops sponsored by Biogen. MCW received research funding and/or compensation for presentations and consultancy from Avexis, Biogen, Grünenthal, Novartis, PTC, Roche, Santhera, and Sarepta.

Figures

Fig. 1
Fig. 1
Presentation of all data entries by neuromuscular centers, genetic institutes and the patient registry, the identification of distinct patients and the allocation to different subtypes of dystrophinopathy (a) or SMA (b)
Fig. 2
Fig. 2
Data entries of distinct patients with a dystrophinopathies or b SMA subdivided based on the different data sources. Data entered only by genetic institutes is displayed in light blue, data from neuromuscular centers in green and data from the patient registry in teal
Fig. 3
Fig. 3
Data stratified by year of birth and the data source. a patients with DMD; b patients with SMA

References

    1. Bladen CL, Rafferty K, Straub V, Monges S, Moresco A, Dawkins H, et al. The TREAT-NMD Duchenne muscular dystrophy registries: conception, design, and utilization by industry and academia. Hum Mutat. 2013;34:1449–1457. doi: 10.1002/humu.22390. - DOI - PubMed
    1. Baker D, Knoppers BM, Phillips M, van Enckevort D, Kaufmann P, Lochmuller H, Taruscio D. Privacy-preserving linkage of genomic and clinical data sets. IEEE/ACM Trans Comput Biol Bioinform. 2018. 10.1109/TCBB.2018.2855125. - PubMed
    1. Ebner H, Hayn D, Falgenhauer M, Nitzlnader M, Schleiermacher G, Haupt R, et al. Piloting the European unified patient identity management (EUPID) concept to facilitate secondary use of neuroblastoma data from Clinical Trials and Biobanking. Stud Health Technol Inform. 2016;223:31–38. - PubMed
    1. Thompson R, Robertson A, Lochmüller H. Natural history, trial readiness and gene discovery: advances in patient registries for neuromuscular disease. Adv Exp Med Biol. 2017;1031:97–124. doi: 10.1007/978-3-319-67144-4_5. - DOI - PubMed
    1. Rodger S, Lochmüller H, Tassoni A, Gramsch K, König K, Bushby K, et al. The TREAT-NMD care and trial site registry: an online registry to facilitate clinical research for neuromuscular diseases. Orphanet J Rare Dis. 2013;8:171. doi: 10.1186/1750-1172-8-171. - DOI - PMC - PubMed

Publication types