Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Nov;112(18):1450-1460.
doi: 10.1002/bdr2.1767. Epub 2020 Aug 19.

Exploratory analysis of machine learning approaches for surveillance of Zika-associated birth defects

Affiliations

Exploratory analysis of machine learning approaches for surveillance of Zika-associated birth defects

Richard Lusk et al. Birth Defects Res. 2020 Nov.

Abstract

In 2016, Centers for Disease Control and Prevention (CDC) established surveillance of pregnant women with Zika virus infection and their infants in the U.S. states, territories, and freely associated states. To identify cases of Zika-associated birth defects, subject matter experts review data reported from medical records of completed pregnancies to identify findings that meet surveillance case criteria (manual review). The volume of reported data increased over the course of the Zika virus outbreak in the Americas, challenging the resources of the surveillance system to conduct manual review. Machine learning was explored as a possible method for predicting case status. Ensemble models (using machine learning algorithms including support vector machines, logistic regression, random forests, k-nearest neighbors, gradient boosted trees, and decision trees) were developed and trained using data collected from January 2016-October 2017. Models were developed separately, on data from the U.S. states, non-Puerto Rico territories, and freely associated states (referred to as the U.S. Zika Pregnancy and Infant Registry [USZPIR]) and data from Puerto Rico (referred to as the Zika Active Pregnancy Surveillance System [ZAPSS]) due to differences in data collection and storage methods. The machine learning models demonstrated high sensitivity for identifying cases while potentially reducing volume of data for manual review (USZPIR: 96% sensitivity, 25% reduction in review volume; ZAPSS: 97% sensitivity, 50% reduction in review volume). Machine learning models show potential for identifying cases of Zika-associated birth defects and for reducing volume of data for manual review, a potential benefit in other public health emergency response settings.

Keywords: Zika virus; birth defects; machine learning; surveillance.

PubMed Disclaimer

Conflict of interest statement

CONFLICT OF INTEREST

The authors declare no potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Flow chart of methods approach to exploring ML application in the Zika case review process

References

    1. Hastie T, Tibshirani R, Friedman J, & Franklin J (2005). The elements of statistical learning: Data mining, inference and prediction. Mathematical Intelligencer, 27(2), 83–85.
    1. Honein MA, Dawson AL, Petersen EE, Jones AM, Lee EH, Yazdy MM, … Jamieson DJ (2017). Birth defects among fetuses and infants of US women with evidence of possible Zika virus infection during pregnancy. JAMA, 317(1), 59–68. 10.1001/jama.2016.19006 - DOI - PubMed
    1. Kang J, Schwartz R, Flickinger J, & Beriwal S (2015). Machine learning approaches for predicting radiation therapy outcomes: A clinician's perspective. International Journal of Radiation Oncology, Biology, Physics, 93(5), 1127–1135. 10.1016/j.ijrobp.2015.07.2286 - DOI - PubMed
    1. Lee Y, Ragguett RM, Mansur RB, Boutilier JJ, Rosenblat JD, Trevizol A, … McIntyre RS (2018). Applications of machine learning algorithms to predict therapeutic outcomes in depression: A meta-analysis and systematic review. Journal of Affective Disorders, 2, e100. 10.1038/tp.2012.10 - DOI - PubMed
    1. Li C, Lu Y, Wu J, Zhang Y, Xia Z, Wang T, … Guo J (2018). LDA Meets Word2Vec: A novel model for academic abstract clustering. In Companion proceedings of the the web conference 2018 (pp. 1699–1706). Lyon, France: International World Wide Web Conferences Steering Committee.

Publication types