Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May:3:31.
doi: 10.3389/frai.2020.00031. Epub 2020 May 15.

Bringing Big Data to Bear in Environmental Public Health: Challenges and Recommendations

Affiliations

Bringing Big Data to Bear in Environmental Public Health: Challenges and Recommendations

Saskia Comess et al. Front Artif Intell. 2020 May.

Abstract

Understanding the role that the environment plays in influencing public health often involves collecting and studying large, complex data sets. There have been a number of private and public efforts to gather sufficient information and confront significant unknowns in the field of environmental public health, yet there is a persistent and largely unmet need for findable, accessible, interoperable, and reusable (FAIR) data. Even when data are readily available, the ability to create, analyze, and draw conclusions from these data using emerging computational tools, such as augmented and artificial inteligence (AI) and machine learning, requires technical skills not currently implemented on a programmatic level across research hubs and academic institutions. We argue that collaborative efforts in data curation and storage, scientific computing, and training are of paramount importance to empower researchers within environmental sciences and the broader public health community to apply AI approaches and fully realize their potential. Leaders in the field were asked to prioritize challenges in incorporating big data in environmental public health research: inconsistent implementation of FAIR principles in data collection and sharing, a lack of skilled data scientists and appropriate cyber-infrastructures, and limited understanding of possibilities and communication of benefits were among those identified. These issues are discussed, and actionable recommendations are provided.

Keywords: artificial intelligence; big data; environmental health sciences; machine learning; open data; public health.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest: LJ is employed by Microsoft Corporation. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Similar articles

Cited by

References

    1. Austin C. P., Colvis C. M., Southall N. T. (2019). Deconstructing the translational tower of babel. Clin. Translat. Sci. 12:85. 10.1111/cts.12595 - DOI - PMC - PubMed
    1. Bai L., Wang J., Ma X., Lu H. (2018). Air pollution forecasts: an overview. Int. J. Environ. Res. Public Health 15:780. 10.3390/ijerph15040780 - DOI - PMC - PubMed
    1. Bell S. M., Sprankle C., Morefield S. Q., Allen D., Phillips J., Sedykh A., et al. . (2017). An integrated chemical environment to support 21st-century toxicology. Environmental Health Perspectives 125, 1–4. 10.1289/EHP1759 - DOI - PMC - PubMed
    1. Bellinger C., Jabbar M., Zaïane O., Osornio-Vargas A. (2017). A systematic review of data mining and machine learning for air pollution epidemiology. BMC Public Health 17:907. 10.1186/s12889-017-4914-3 - DOI - PMC - PubMed
    1. Bobb J. F., Valeri L., Henn B. C., Christiani D. C., Wright R. O., Mazumdar M., et al. . (2014). Bayesian kernel machine regression for estimating the health effects of multi-pollutant mixtures. Biostatistics 16, 493–508. 10.1093/biostatistics/kxu058 - DOI - PMC - PubMed

LinkOut - more resources