Use of imputed population-based cancer registry data as a method of accounting for missing information: application to estrogen receptor status for breast cancer
- PMID: 22842721
- PMCID: PMC3491971
- DOI: 10.1093/aje/kwr512
Use of imputed population-based cancer registry data as a method of accounting for missing information: application to estrogen receptor status for breast cancer
Abstract
The National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) Program provides a rich source of data stratified according to tumor biomarkers that play an important role in cancer surveillance research. These data are useful for analyzing trends in cancer incidence and survival. These tumor markers, however, are often prone to missing observations. To address the problem of missing data, the authors employed sequential regression multivariate imputation for breast cancer variables, with a particular focus on estrogen receptor status, using data from 13 SEER registries covering the period 1992-2007. In this paper, they present an approach to accounting for missing information through the creation of imputed data sets that can be analyzed using existing software (e.g., SEER*Stat) developed for analyzing cancer registry data. Bias in age-adjusted trends in female breast cancer incidence is shown graphically before and after imputation of estrogen receptor status, stratified by age and race. The imputed data set will be made available in SEER*Stat (http://seer.cancer.gov/analysis/index.html) to facilitate accurate estimation of breast cancer incidence trends. To ensure that the imputed data set is used correctly, the authors provide detailed, step-by-step instructions for conducting analyses. This is the first time that a nationally representative, population-based cancer registry data set has been imputed and made available to researchers for conducting a variety of analyses of breast cancer incidence trends.
Figures



References
-
- Anderson WF, Katki HA, Rosenberg PS. Incidence of breast cancer in the United States: current and future trends. J Natl Cancer Inst. 2011;103(18):1397–1402. ( doi:10.1093/jnci/djr257) - DOI - PMC - PubMed
-
- Ravdin PM, Cronin KA, Howlader N, et al. The decrease in breast-cancer incidence in 2003 in the United States. N Engl J Med. 2007;356(16):1670–1674. - PubMed
-
- Fritz A, Ries L. SEER Program Code Manual. 3rd. Bethesda, MD: National Cancer Institute; 1998. (http://seer.cancer.gov/manuals/codeman.pdf. ). (Accessed February 2, 2011)
-
- Little RJA, Rubin DB. Statistical Analysis With Missing Data. 2nd. New York, NY: John Wiley & Sons, Inc; 2002.
-
- Allison PD. Missing Data. Thousand Oaks, CA: Sage Publications; 2001.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical