Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017;112(517):54-63.
doi: 10.1080/01621459.2016.1191499. Epub 2017 May 3.

Defining Cancer Subtypes With Distinctive Etiologic Profiles: An Application to the Epidemiology of Melanoma

Affiliations

Defining Cancer Subtypes With Distinctive Etiologic Profiles: An Application to the Epidemiology of Melanoma

Audrey Mauguen et al. J Am Stat Assoc. 2017.

Abstract

We showcase a novel analytic strategy to identify sub-types of cancer that possess distinctive causal factors, i.e. sub-types that are "etiologically" distinct. The method involves the integrated analysis of two types of study design: an incident series of cases with double primary cancers with detailed information on tumor characteristics that can be used to define the sub-types; a case-series of incident cases with information on known risk factors that can be used to investigate the specific risk factors that distinguish the sub-types. The methods are applied to a rich melanoma dataset with detailed information on pathologic tumor factors, and comprehensive information on known genetic and environmental risk factors for melanoma. Identification of the optimal sub-typing solution is accomplished using a novel clustering analysis that seeks to maximize a measure that characterizes the distinctiveness of the distributions of risk factors across the sub-types and that is a function of the correlations of tumor factors in the case-specific tumor pairs. This analysis is challenged by the presence of extensive missing data. If successful, studies of this nature offer the opportunity for efficient study design to identify unknown risk factors whose effects are concentrated in defined sub-types.

Keywords: case-control study; etiologic heterogeneity; k-means clustering; logistic regression; multiple imputation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic Representation of Clustering Strategy
Figure 1
Figure 1
Schematic Representation of Clustering Strategy
Figure 1
Figure 1
Schematic Representation of Clustering Strategy
Figure 2
Figure 2
Plot of Heterogeneity Measure Versus k-Means Distance Measure for Clustering Solutions

Similar articles

Cited by

References

    1. Althuis MD, Fergenbaum JH, Garcia-Closas M, Brinton LA, Madigan MP, Sherman ME. Etiology of hormone receptor-defined breast cancer: a systematic review of the literature. Cancer Epidemiology. Biomarkers and Prevention. 2004;13:1558–1568. - PubMed
    1. Bauer J, Büttner P, Murali R, Okamoto I, Kolaitis NA, Landi MT, Scolyer RA, Bastian BC. BRAF mutations in cutaneous melanoma are independently associated with age, anatomic site of the primary tumor, and the degree of solar elastosis at the primary tumor site. Pigment Cell and Melanoma Research. 2011;24(2):345–51. - PMC - PubMed
    1. Begg CB, Gray RJ. Calculation of polychotomous logistic regression parameters using individualized regressions. Biometrika 1984. 1984;71:11–18.
    1. Begg CB, Zhang ZF. Statistical analysis of molecular epidemiology studies employing case-series. Cancer Epidemiology, Biomarkers and Prevention. 1994;3:173–175. - PubMed
    1. Begg CB, Hummer AJ, Mujumdar U, Armstrong BK, Kricker A, Marrett LD, Millikan RC, Gruber SB, Culver HA, Zanetti R, Gallagher RP, Dwyer T, Rebbeck TR, Busam K, From L, Berwick M. A design for cancer case-control studies using only incident cases: experience with the GEM study of melanoma. International Journal of Epidemiology. 2006;35:756–764. - PubMed

Publication types