Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Apr;29(2):231-239.
doi: 10.1097/MOP.0000000000000467.

Big and disparate data: considerations for pediatric consortia

Affiliations
Review

Big and disparate data: considerations for pediatric consortia

Jeanette A Stingone et al. Curr Opin Pediatr. 2017 Apr.

Abstract

Purpose of review: Increasingly, there is a need for examining exposure disease associations in large, diverse datasets to understand the complex determinants of pediatric disease and disability. Recognizing that children's health research consortia will be important sources of big data, it is crucial for the pediatric research community to be knowledgeable about the challenges and opportunities that they will face. The present review will provide examples of existing children's health consortia, highlight recent pooled analyses conducted by children's health research consortia, address common challenges of pooled analyses, and provide recommendations to advance collective research efforts in pediatric research.

Recent findings: Formal consortia and other collective-science initiatives are increasingly being created to share individual data from a set of relevant epidemiological studies to address a common research topic under the concept that the joint effort of many individual groups can accomplish far more than working alone. There are practical challenges to the participation of investigators within consortia that need to be addressed in order for them to work.

Summary: Researchers who access consortia with data centers will be able to go far beyond their initial hypotheses and potentially accomplish research that was previously thought infeasible or too costly.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest: None declared.

Figures

Figure 1
Figure 1
Construction of Harmonized Variables Across Disparate Studies using Standardized Terms within an Ontology. Using maternal education as an illustrative example, variable harmonization starts by detailing the response levels of the variable across disparate studies and then mapping to common terms that encompass all of the response levels. These terms then form the response-levels of the new, harmonized variable that can be used in pooled analyses.

References

    1. Thompson A. Thinking big: large-scale collaborative research in observational epidemiology. Eur J Epidemiol. 2009;24(12):727–31. - PubMed
    1. National Institutes of Health. What is Big Data? Data Science at NIH. 2016 [Available from: https://datascience.nih.gov/bd2k/about/what]
    1. Yoo C, Ramirez L, Liuzzi J. Big data analysis using modern statistical and machine learning methods in medicine. Int NeurourolJ. 2014;18(2):50–7. - PMC - PubMed
    1. Fan J, Han F, Liu H. Challenges of Big Data Analysis. NatlSci Rev. 2014;1(2):293–314. - PMC - PubMed
    1. National Institute of Environmental Health Sciences. Data Repository, Analysis, and Science Center. CHEAR Program. 2016 [Available from: https://chearprogram.org/about/datarep.] This comprehensive environmental exposure resource is a groundbreaking effort to increase the use of environmental biomarkers in children’s health research.