Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Dec 20;32(29):5039-52.
doi: 10.1002/sim.5902. Epub 2013 Jul 16.

A conceptual and methodological framework for investigating etiologic heterogeneity

Affiliations

A conceptual and methodological framework for investigating etiologic heterogeneity

Colin B Begg et al. Stat Med. .

Abstract

Cancer has traditionally been studied using the disease site of origin as the organizing framework. However, recent advances in molecular genetics have begun to challenge this taxonomy, as detailed molecular profiling of tumors has led to discoveries of subsets of tumors that have profiles that possess distinct clinical and biological characteristics. This is increasingly leading to research that seeks to investigate whether these subtypes of tumors have distinct etiologies. However, research in this field has been opportunistic and anecdotal, typically involving the comparison of distributions of individual risk factors between tumors classified on the basis of candidate tumor characteristics. The purpose of this article is to place this area of investigation within a more general conceptual and analytic framework, with a view to providing more efficient and practical strategies for designing and analyzing epidemiologic studies to investigate etiologic heterogeneity. We propose a formal definition of etiologic heterogeneity and show how classifications of tumor subtypes with larger etiologic heterogeneities inevitably possess greater disease risk predictability overall. We outline analytic strategies for estimating the degree of etiologic heterogeneity among a set of subtypes and for choosing subtypes that optimize the heterogeneity, and we discuss technical challenges that require further methodologic research. We illustrate the ideas by using a pooled case-control study of breast cancer classified by expression patterns of genes known to define distinct tumor subtypes.

Keywords: cancer epidemiology; clustering; etiologic heterogeneity.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The figure displays conceptually the interplay of genetic and pathologic tumor characteristics (denoted by {yi}) and risk profiles (denoted by {xi}) that are used interactively to determine the set of subtypes that best characterize etiologic heterogeneity.
Figure 2
Figure 2
The incremental explained variation is plotted against the distance metric for every possible two-class configuration of the 16 distinct permutations of the four binary markers, excluding only configurations in which one subtype contains fewer than 5% of the cases (green dots). The red dots represent local maxima obtained from k-means clustering. The blue dot corresponds to the classification ER+ versus ER−.
Figure 3
Figure 3
This figure is similar to Figure 2, except that the green dots represent a random sample of 50,000 candidate configurations. The blue dot corresponds to the widely used four-class system: luminal A, luminal B, HER2-enriched and triple negative.

Similar articles

Cited by

References

    1. Harris TJ, McCormick F. The molecular pathology of cancer. Nature Reviews Clinical Oncology. 2010;7:251–265. - PubMed
    1. Barretina J, Taylor BS, Banerji S, Ramos AH, Lagos-Quintana M, Decarolis PL, et al. Subtype-specific genomic alterations define new targets for soft-tissue sarcoma therapy. Nature Genetics. 2010;42:715–721. - PMC - PubMed
    1. Taylor BS, Schultz N, Hieronymus H, Gopalan A, Xiao Y, Carver BS, et al. Integrative genomic profiling of human prostate cancer. Cancer Cell. 2010;18:11–22. - PMC - PubMed
    1. Hunter DJ, Kraft P, Jacobs KB, Cox DG, Yeager M, Hankinson SE, et al. A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nature Genetics. 2007;39:870–874. - PMC - PubMed
    1. Easton DF, Pooley KA, Dunning AM, Pharoah PD, Thompson D, Ballinger DG, et al. Genome-wide association study identifies novel breast cancer susceptibility loci. Nature. 2007;447:1087–1093. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources