Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Oct:86:80-89.e2.
doi: 10.1016/j.annepidem.2023.07.006. Epub 2023 Jul 20.

Improving data capture of race and ethnicity for the Food and Drug Administration Sentinel database: a narrative review

Affiliations
Review

Improving data capture of race and ethnicity for the Food and Drug Administration Sentinel database: a narrative review

Monica Ter-Minassian et al. Ann Epidemiol. 2023 Oct.

Abstract

Purpose: The U.S. Food and Drug Administration's Sentinel System is a national medical product safety surveillance system consisting of a large multisite distributed database of administrative claims supplemented by electronic health-care record data. The program seeks to improve data capture of race and ethnicity for pharmacoepidemiology studies.

Methods: We conducted a narrative literature review of published research on data augmentation and imputation methods to improve race and ethnicity capture in U.S. health-care systems databases. We focused on methods with limited (five-digit ZIP codes only) or full patient identifiers available to link to external sources of self-reported data. We organized the literature by themes: (1) variation in data capture of self-reported data, (2) data augmentation from external sources of self-reported data, and (3) imputation methods, including Bayesian analysis and multiple regression.

Results: Researchers reduced data missingness with high validity for Asian, Black, White, and Pacific Islander racial groups and Hispanic ethnicity. Native American and multiracial groups were difficult to validate due to relatively small sample sizes.

Conclusions: Limitations on accessible self-reported data for validation will dictate methods to improve race and ethnicity data capture. We recommend methods leveraging multiple sources that account for variations in geography, age, and sex.

Keywords: American Indian or Alaska Native; Ethnicity; Geocoding; Geographic mapping; Hispanic or Latino; Imputation; Missing data.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Publication types

Grants and funding

LinkOut - more resources