Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Aug 15;176(4):347-56.
doi: 10.1093/aje/kwr512. Epub 2012 Jul 25.

Use of imputed population-based cancer registry data as a method of accounting for missing information: application to estrogen receptor status for breast cancer

Affiliations

Use of imputed population-based cancer registry data as a method of accounting for missing information: application to estrogen receptor status for breast cancer

Nadia Howlader et al. Am J Epidemiol. .

Abstract

The National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) Program provides a rich source of data stratified according to tumor biomarkers that play an important role in cancer surveillance research. These data are useful for analyzing trends in cancer incidence and survival. These tumor markers, however, are often prone to missing observations. To address the problem of missing data, the authors employed sequential regression multivariate imputation for breast cancer variables, with a particular focus on estrogen receptor status, using data from 13 SEER registries covering the period 1992-2007. In this paper, they present an approach to accounting for missing information through the creation of imputed data sets that can be analyzed using existing software (e.g., SEER*Stat) developed for analyzing cancer registry data. Bias in age-adjusted trends in female breast cancer incidence is shown graphically before and after imputation of estrogen receptor status, stratified by age and race. The imputed data set will be made available in SEER*Stat (http://seer.cancer.gov/analysis/index.html) to facilitate accurate estimation of breast cancer incidence trends. To ensure that the imputed data set is used correctly, the authors provide detailed, step-by-step instructions for conducting analyses. This is the first time that a nationally representative, population-based cancer registry data set has been imputed and made available to researchers for conducting a variety of analyses of breast cancer incidence trends.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Distribution of estrogen receptor status over time for female patients with malignant breast cancer in 13 Surveillance, Epidemiology, and End Results registries, 1992–2007.
Figure 2.
Figure 2.
Breast cancer incidence trends for US white women according to estrogen receptor (ER) status (observed vs. imputed), 1992–2007. A) All age groups; B) ages 40–49 years; C) ages 50–59 years; D) ages 60–69 years; E) ages ≥70 years. Blue dot, ER-positive observed rate; red dot, ER-positive imputed rate; blue triangle, ER-negative observed rate; red triangle, ER-negative imputed rate. The blue line denotes the rate modeled by the Joinpoint Regression Program for the ER observed trend; the red line denotes the Joinpoint-modeled rate for the ER imputed trend.
Figure 3.
Figure 3.
Breast cancer incidence trends for US black women according to estrogen receptor (ER) status (observed vs. imputed), 1992–2007. A) All age groups; B) ages 40–49 years; C) ages 50–59 years; D) ages 60–69 years; E) ages ≥70 years. Blue dot, ER-positive observed rate; red dot, ER-positive imputed rate; blue triangle, ER-negative observed rate; red triangle, ER-negative imputed rate. The blue line denotes the rate modeled by the Joinpoint Regression Program for the ER observed trend; the red line denotes the Joinpoint-modeled rate for the ER imputed trend.

References

    1. Anderson WF, Katki HA, Rosenberg PS. Incidence of breast cancer in the United States: current and future trends. J Natl Cancer Inst. 2011;103(18):1397–1402. ( doi:10.1093/jnci/djr257) - DOI - PMC - PubMed
    1. Ravdin PM, Cronin KA, Howlader N, et al. The decrease in breast-cancer incidence in 2003 in the United States. N Engl J Med. 2007;356(16):1670–1674. - PubMed
    1. Fritz A, Ries L. SEER Program Code Manual. 3rd. Bethesda, MD: National Cancer Institute; 1998. (http://seer.cancer.gov/manuals/codeman.pdf. ). (Accessed February 2, 2011)
    1. Little RJA, Rubin DB. Statistical Analysis With Missing Data. 2nd. New York, NY: John Wiley & Sons, Inc; 2002.
    1. Allison PD. Missing Data. Thousand Oaks, CA: Sage Publications; 2001.

Publication types