Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 15;40(1):125-136.
doi: 10.3233/sji-230125.

Evaluating data quality for blended data using a data quality framework

Affiliations

Evaluating data quality for blended data using a data quality framework

Jennifer D Parker et al. Stat J IAOS. .

Abstract

In 2020 the U.S. Federal Committee on Statistical Methodology (FCSM) released "A Framework for Data Quality", organized by 11 dimensions of data quality grouped among three domains of quality (utility, objectivity, integrity). This paper addresses the use of the FCSM Framework for data quality assessments of blended data. The FCSM Framework applies to all types of data, however best practices for implementation have not been documented. We applied the FCSM Framework for three health-research related case studies. For each case study, assessments of data quality dimensions were performed to identify threats to quality, possible mitigations of those threats, and trade-offs among them. From these assessments the authors concluded: 1) data quality assessments are more complex in practice than anticipated and expert guidance and documentation are important; 2) each dimension may not be equally important for different data uses; 3) data quality assessments can be subjective and having a quantitative tool could help explain the results, however, quantitative assessments may be closely tied to the intended use of the dataset; 4) there are common trade-offs and mitigations for some threats to quality among dimensions. This paper is one of the first to apply the FCSM Framework to specific use-cases and illustrates a process for similar data uses.

Keywords: Administrative Data; Blended Data; Data Linkage; Data Quality; Health Surveys.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
FCSM Framework for Data Quality Source: FCSM 2020, FCSM-20-04 A Framework for Data Quality

References

    1. Federal Committee on Statistical Methodology. 2020. A Framework for Data Quality. FCSM 20-04. September 2020. Available from https://www.fcsm.gov/assets/files/docs/FCSM.20.04_A_Framework_for_Data_Q... [Accessed 6 December 2023].
    1. National Center for Health Statistics. NCHS Data Linkage. NCHS Data Linked to US Department of Housing and Urban Development (HUD) Housing Assistance Data. NCHS Data Linkage - HUD Administrative Data [homepage on the internet] NCHS; 2023. Available from https://www.cdc.gov/nchs/data-linkage/hud.htm. [Accessed 6 December 2023].
    1. U.S. Environmental Protection Agency (USEPA). Air Data: Air Quality Data Collected at Outdoor Monitors Across the US. [homepage on the internet]. USEPA; 2023. Available from https://www.epa.gov/outdoor-air-quality-data. [Accessed 6 December 2023]
    1. U.S. Environmental Protection Agency (USEPA). CMAQ: The Community Multiscale Air Quality Modeling System. [homepage on the internet]. USEPA; 2023. Available from https://www.epa.gov/cmaq. [reviewed 2023 November 30; cited 2023 December 6]
    1. U.S. Environmental Protection Agency (USEPA). Remote Sensing Information Gateway (RSIG)-Related Downloadable Data Files. [homepage on the internet]. USEPA; 2023. Available from https://www.epa.gov/hesc/rsig-related-downloadable-data-files. [Accessed 6 December 2023]

LinkOut - more resources