Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2016 May;23(3):627-34.
doi: 10.1093/jamia/ocv156. Epub 2015 Dec 11.

Assessing race and ethnicity data quality across cancer registries and EMRs in two hospitals

Affiliations
Comparative Study

Assessing race and ethnicity data quality across cancer registries and EMRs in two hospitals

Simon J Craddock Lee et al. J Am Med Inform Assoc. 2016 May.

Abstract

Background: Measurement of patient race/ethnicity in electronic health records is mandated and important for tracking health disparities.

Objective: Characterize the quality of race/ethnicity data collection efforts.

Methods: For all cancer patients diagnosed (2007-2010) at two hospitals, we extracted demographic data from five sources: 1) a university hospital cancer registry, 2) a university electronic medical record (EMR), 3) a community hospital cancer registry, 4) a community EMR, and 5) a joint clinical research registry. The patients whose data we examined (N = 17 834) contributed 41 025 entries (range: 2-5 per patient across sources), and the source comparisons generated 1-10 unique pairs per patient. We used generalized estimating equations, chi-squares tests, and kappas estimates to assess data availability and agreement.

Results: Compared to sex and insurance status, race/ethnicity information was significantly less likely to be available (χ(2 )> 8043, P < .001), with variation across sources (χ(2 )> 10 589, P < .001). The university EMR had a high prevalence of "Unknown" values. Aggregate kappa estimates across the sources was 0.45 (95% confidence interval, 0.45-0.45; N = 31 276 unique pairs), but improved in sensitivity analyses that excluded the university EMR source (κ = 0.89). Race/ethnicity data were in complete agreement for only 6988 patients (39.2%). Pairs with a "Black" data value in one of the sources had the highest agreement (95.3%), whereas pairs with an "Other" value exhibited the lowest agreement across sources (11.1%).

Discussion: Our findings suggest that high-quality race/ethnicity data are attainable. Many of the "errors" in race/ethnicity data are caused by missing or "Unknown" data values.

Conclusions: To facilitate transparent reporting of healthcare delivery outcomes by race/ethnicity, healthcare systems need to monitor and enforce race/ethnicity data collection standards.

Keywords: cancer registry; data quality; electronic medical record; race and ethnicity.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:
Proportion of patients with availability of race/ethnicity, sex, and health insurance variables across the five data sources. Panel 1: Race/Ethnicity. Panel 2: Sex. Panel 3: Health Insurance. Each vertical bar represents point estimates for a specific source, with 95% confidence intervals denoted by the error bars. Source 5 did not record insurance information. *** Denotes that data availability for a given source is significantly different from the other sources (P < .001).
Figure 2:
Figure 2:
Disagreement analyses stratified by race-ethnicity category, excluding pairs containing an “Unknown” value. 95% CI values denote error bars.

References

    1. Thorlby R, Jorgensen S, Siegel B, Ayanian JZ. How health care organizations are using data on patients' race and ethnicity to improve quality of care. Milbank Q. 2011;89(2):226–255. - PMC - PubMed
    1. Mays VM, Ponce NA, Washington DL, Cochran SD. Classification of race and ethnicity: Implications for public health. Annu Rev Public Health. 2003;24(1):83–110. - PMC - PubMed
    1. Hahn RA. The state of federal health statistics on racial and ethnic groups. JAMA. 1992;267(2):268–271. - PubMed
    1. Manly JJ. Deconstructing race and ethnicity - Implications for measurement of health outcomes. Medical Care. 2006;44(11):S10–S16. - PubMed
    1. Ford ME, Kelly PA. Conceptualizing and categorizing race and ethnicity in health services research. Health Serv Res. 2005;40(5 Pt 2):1658–1675. - PMC - PubMed

Publication types