Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 13;21(1):70.
doi: 10.1186/s12969-023-00839-2.

The successes and challenges of harmonising juvenile idiopathic arthritis (JIA) datasets to create a large-scale JIA data resource

Collaborators, Affiliations

The successes and challenges of harmonising juvenile idiopathic arthritis (JIA) datasets to create a large-scale JIA data resource

Saskia Lawson-Tovey et al. Pediatr Rheumatol Online J. .

Abstract

Background: CLUSTER is a UK consortium focussed on precision medicine research in JIA/JIA-Uveitis. As part of this programme, a large-scale JIA data resource was created by harmonizing and pooling existing real-world studies. Here we present challenges and progress towards creation of this unique large JIA dataset.

Methods: Four real-world studies contributed data; two clinical datasets of JIA patients starting first-line methotrexate (MTX) or tumour necrosis factor inhibitors (TNFi) were created. Variables were selected based on a previously developed core dataset, and encrypted NHS numbers were used to identify children contributing similar data across multiple studies.

Results: Of 7013 records (from 5435 individuals), 2882 (1304 individuals) represented the same child across studies. The final datasets contain 2899 (MTX) and 2401 (TNFi) unique patients; 1018 are in both datasets. Missingness ranged from 10 to 60% and was not improved through harmonisation.

Conclusions: Combining data across studies has achieved dataset sizes rarely seen in JIA, invaluable to progressing research. Losing variable specificity and missingness, and their impact on future analyses requires further consideration.

Keywords: Children and young people; Data harmonisation; JIA.

PubMed Disclaimer

Conflict of interest statement

SLT, SS, NG, SSW, SN have nothing to disclose. MRB reports grant income from Abbvie unrelated to this work. LRW reports non-personal consulting fees from Pfizer unrelated to this work; LRW is supported by the NIHR Great Ormond Street Biomedical Research Centre. KLH reports she has received non-personal speaker’s fees from Abbvie and grant income from BMS and Pfizer, all unrelated to this manuscript; KLH is supported by the NIHR Manchester Biomedical Research Centre.

Figures

Fig. 1
Fig. 1
Distribution of individuals with more than one record at any timepoint (“duplicates”) across CLUSTER studies
Fig. 2
Fig. 2
Percentage of missingness across key variables in the CLUSTER MTX and TNF datasets MTX Methotrexate, TNF Tumour necrosis factor inhibitors, T1 Timepoint 1 (closest values to baseline; allowed -3 months to drug start), T2 Timepoint 2 (closest value to 6 months after drug start; allowed 3–12 months after drug start), JIA Juvenile idiopathic arthritis, RF Rheumatoid factor, HLA B27 Human leukocyte antigen B27, ANA Anti-nuclear antibody, CHAQ Childhood health assessment questionnaire, ESR Erythrocyte sedimentation rate, CRP C-reactive protein, VAS Visual analogue scale

References

    1. Minden K. Juvenile Idiopathic Arthritis in Adolescence and Young Adulthood. In: Adolescent and Young Adult Rheumatology in Clinical Practice. Springer, Cham; 2019. p. 85–105. Available from: https://link.springer.com/chapter/10.1007/978-3-319-95519-3_7
    1. Doiron D, Burton P, Marcon Y, Gaye A, Wolffenbuttel BHR, Perola M, et al. Data harmonization and federated analysis of population-based studies: The BioSHaRE project. Emerg Themes Epidemiol. 2013 Nov 21 [cited 2021 Mar 12];10:12. Available from: https://ete-online.biomedcentral.com/articles/10.1186/1742-7622-10-12 - PMC - PubMed
    1. CLUSTER Consortium. [cited 2021 Mar 4]. Available from: https://www.clusterconsortium.org.uk/
    1. Doan A, Halevy A, Ives Z. 1 - Introduction. In: Doan A, Halevy A, Ives Z, editors. Principles of Data Integration. Morgan Kaufmann; 2012. p. 1–18. 10.1016/B978-0-12-416044-6.00001-6. https://www.sciencedirect.com/science/article/pii/B9780124160446000016.
    1. United Nations Statistical Office. Volume 1: Legal, organizational and technical aspects. In: Handbook of Vital Statistics Systems and Methods. United Nations. 1991. ISBN: 9211613280.

Substances