Multiple Imputation of Race and Hispanic Ethnicity in National Surveillance Data for Chlamydia, Gonorrhea, and Syphilis
- PMID: 38885519
- PMCID: PMC11560705
- DOI: 10.1097/OLQ.0000000000002047
Multiple Imputation of Race and Hispanic Ethnicity in National Surveillance Data for Chlamydia, Gonorrhea, and Syphilis
Abstract
Background: Disease burden of sexually transmitted infections such as chlamydia, gonorrhea, and syphilis is often compared across age categories, sex categories, and race and ethnicity categories. Missing data may prevent researchers from accurately characterizing health disparities between populations. This article describes the methods used to impute race and Hispanic ethnicity in a large national surveillance data set.
Methods: All US cases of chlamydia, gonorrhea, and syphilis (excluding congenital syphilis) reported through the National Notifiable Diseases Surveillance System from the year 2019 were included in the analyses. We used fully conditional specification to impute missing race and Hispanic ethnicity data. After imputation, reported case rates were calculated, by disease, for each race and Hispanic ethnicity category using Vintage 2019 Population and Housing Unit Estimates from the US Census. We then used case counts from subsets that contained only complete race and Hispanic ethnicity information to investigate if the confidence intervals from the multiply imputed data included the observed number of cases in each race and Hispanic ethnicity category.
Results: Among the 2,553,038 cases reported in 2019, race and Hispanic ethnicity were multiply imputed for 9% of syphilis cases, 22% of gonorrhea cases, and 33% of chlamydia cases. In the subset analyses, every nonzero rate of reported cases was contained within the confidence intervals that were calculated from multiply imputed data.
Conclusions: Confidence intervals that account for the uncertainty of the predictions are an advantage of multiple imputation over complete-case analysis because a realistic variance estimate allows for valid hypothesis testing results.
Copyright © 2024 American Sexually Transmitted Diseases Association. All rights reserved.
Conflict of interest statement
Conflict of interest and sources of funding: None declared.
Figures




References
-
- Centers for Disease Control and Prevention. National Notifiable Diseases Surveillance System (NNDSS). Accessed April 25, 2024. https://www.cdc.gov/nndss/index.html
-
- Department of Health and Human Services. STI National Strategic Plan Overview. Accessed April 25, 2024. https://www.hhs.gov/programs/topic-sites/sexually-transmitted-infections...
-
- Office of Disease Prevention and Health Promotion. Healthy People 2030 Framework. Accessed April 25, 2024. https://health.gov/healthypeople/about/healthy-people-2030-framework
-
- Centers for Disease Control and Prevention. Sexually Transmitted Disease Surveillance 2019. Accessed April 25, 2024. https://www.cdc.gov/std/statistics/2019/std-surveillance-2019.pdf
-
- van Buuren S. Flexible Imputation of Missing Data. CRC Press; 2012.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical