Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 9;47(9):zsae115.
doi: 10.1093/sleep/zsae115.

Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts

Affiliations

Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts

Meredith L Wallace et al. Sleep. .

Abstract

Study objectives: Harmonizing and aggregating data across studies enables pooled analyses that support external validation and enhance replicability and generalizability. However, the multidimensional nature of sleep poses challenges for data harmonization and aggregation. Here we describe and implement our process for harmonizing self-reported sleep data.

Methods: We established a multi-phase framework to harmonize self-reported sleep data: (1) compile items, (2) group items into domains, (3) harmonize items, and (4) evaluate harmonizability. We applied this process to produce a pooled multi-cohort sample of five US cohorts plus a separate yet fully harmonized sample from Rotterdam, Netherlands. Sleep and sociodemographic data are described and compared to demonstrate the utility of harmonization and aggregation.

Results: We collected 190 unique self-reported sleep items and grouped them into 15 conceptual domains. Using these domains as guiderails, we developed 14 harmonized items measuring aspects of satisfaction, alertness/sleepiness, timing, efficiency, duration, insomnia, and sleep apnea. External raters determined that 13 of these 14 items had moderate-to-high harmonizability. Alertness/Sleepiness items had lower harmonizability, while continuous, quantitative items (e.g. timing, total sleep time, and efficiency) had higher harmonizability. Descriptive statistics identified features that are more consistent (e.g. wake-up time and duration) and more heterogeneous (e.g. time in bed and bedtime) across samples.

Conclusions: Our process can guide researchers and cohort stewards toward effective sleep harmonization and provide a foundation for further methodological development in this expanding field. Broader national and international initiatives promoting common data elements across cohorts are needed to enhance future harmonization and aggregation efforts.

Keywords: data harmonization; data sharing; data standardization; sleep disorder; sleep health; sleepiness.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Flow chart summarizing harmonization steps.
Figure 2.
Figure 2.
Initial rater agreement for N = 190 self-report sleep items, summarized based on the final agreed-upon domain.
Figure 3.
Figure 3.
Mean and standard deviation of the item H-scores (rows 2–15, column 1 [“All Cohorts”]), the cohort-item H-scores (rows 2–15, columns 2–7), the cohort H-scores (row 1 [“All Items”], columns 2–7), and the overall H-score (row 1, column 1). Gray squares with no text indicate the question was not available in that cohort.
Figure 4.
Figure 4.
Harmonization scores for: (A) daytime dysfunction due to sleepiness; and (B) total sleep time.
Figure 5.
Figure 5.
Mean and standard deviation of harmonization scores (H-scores) for cohort pairs, across all available items.

Comment in

Similar articles

Cited by

References

    1. Bernard C. Stop reproducing the reproducibility crisis. eNeuro. 2023;10(2):ENEURO.0430–ENEU23.2023. doi: 10.1523/ENEURO.0430-23.2023 - DOI - PMC - PubMed
    1. Tackett JL, Brandes CM, King KM, Markon KE.. Psychology’s replication crisis and clinical psychological science. Annu Rev Clin Psychol. 2019;15:579–604. doi: 10.1146/annurev-clinpsy-050718-095710 - DOI - PubMed
    1. Stupple A, Singerman D, Celi LA.. The reproducibility crisis in the age of digital medicine. NPJ Digit Med. 2019;2:2. doi: 10.1038/s41746-019-0079-z - DOI - PMC - PubMed
    1. Wasserstein RL, Lazar NA.. The ASA’s statement on p-values: context, process, and purpose. Am Stat. 2016;70(2):129–133. doi: 10.1080/00031305.2016.1154108 - DOI
    1. Zhang GQ, Cui L, Mueller R, et al.. The National Sleep Research Resource: towards a sleep data commons. J Am Med Inform Assoc. 2018;25(10):1351–1358. doi: 10.1093/jamia/ocy064 - DOI - PMC - PubMed

Publication types

Grants and funding