Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Oct;26(10):10-20.

Linking 2006 Census and hospital data in Canada

Affiliations
  • PMID: 26488823
Free article

Linking 2006 Census and hospital data in Canada

Michelle Rotermann et al. Health Rep. 2015 Oct.
Free article

Abstract

Background: Record linkage is commonly used in health research to fill data gaps. This study summarizes the linkage of the 2006 Census of Population (excluding Quebec) to hospital data from the Discharge Abstract Database (DAD).

Data and methods: Hierarchical deterministic exact matching was employed to link 2006 Census and DAD (2006/2007, 2007/2008 and 2008/2009) data, based on linkage keys derived from three variables common to both files-date of birth, postal code and sex. The full census file (short-form; 23.4 million) was used for record linkage; the 20% file (long-form; 4.65 million) representing the study cohort was used for validation. Linked files were compared across jurisdictions, years and other selected covariates in terms of eligibility for linkage, keys linked, and linkage and coverage rates.

Results: Overall, 80% of linkage keys identified in the DAD were linked to the 2006 Census. The percentage of long-form census respondents linked to at least one hospital record ranged between 5% and 8% across jurisdictions; linkage rates were higher among known high users of hospital services: older age groups, lower-income individuals, and Aboriginal people. In general, the linked census file represents the majority of hospital events that occurred during the study period. Coverage rates (weighted/unweighted) varied by geography and age group, with lower weighted rates for the territories and some younger age groups.

Interpretation: With hierarchical deterministic exact matching, census data can be linked to multiple years of DAD data. Incorporation of updated postal codes from tax files reduced linkage rate attrition over time. Lower coverage rates for the territories and younger age groups suggest that these populations may be underrepresented in the linked files.

Keywords: Coverage; data collection; data linkage; databases; hospital records; medical records.

PubMed Disclaimer

LinkOut - more resources