Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Dec;46(8):555-571.
doi: 10.1002/gepi.22497. Epub 2022 Aug 4.

Genetic heterogeneity: Challenges, impacts, and methods through an associative lens

Affiliations
Review

Genetic heterogeneity: Challenges, impacts, and methods through an associative lens

Alexa A Woodward et al. Genet Epidemiol. 2022 Dec.

Abstract

Genetic heterogeneity describes the occurrence of the same or similar phenotypes through different genetic mechanisms in different individuals. Robustly characterizing and accounting for genetic heterogeneity is crucial to pursuing the goals of precision medicine, for discovering novel disease biomarkers, and for identifying targets for treatments. Failure to account for genetic heterogeneity may lead to missed associations and incorrect inferences. Thus, it is critical to review the impact of genetic heterogeneity on the design and analysis of population level genetic studies, aspects that are often overlooked in the literature. In this review, we first contextualize our approach to genetic heterogeneity by proposing a high-level categorization of heterogeneity into "feature," "outcome," and "associative" heterogeneity, drawing on perspectives from epidemiology and machine learning to illustrate distinctions between them. We highlight the unique nature of genetic heterogeneity as a heterogeneous pattern of association that warrants specific methodological considerations. We then focus on the challenges that preclude effective detection and characterization of genetic heterogeneity across a variety of epidemiological contexts. Finally, we discuss systems heterogeneity as an integrated approach to using genetic and other high-dimensional multi-omic data in complex disease research.

Keywords: GWAS; complex disease; genetic heterogeneity; precision medicine.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Conceptual illustrations contrasting homogeneity and heterogeneity using example observations within features, outcomes, or associations. Panel (a) depicts age as a feature with less variability on the left and with more variability on the right. Panel (b) depicts a phenotypic outcome, again with less variability on the left and more variability on the right. Panel (c) depicts subjects (rows) and features (columns) where features can have different values (shading). On the left, the feature highlighted by the dotted box is homogeneously associated with Disease X. On the right, associative heterogeneity is represented by two different features independently associated with Disease X within different groups of subjects.
Figure 2
Figure 2
Examples of two independent causes of a single phenotype under four scenarios, allelic heterogeneity, locus heterogeneity, phenocopy, and more complex examples of heterogeneity. Causal factors are in black.

Similar articles

Cited by

References

    1. Albiñana, C. , Grove, J. , McGrath, J. J. , Agerbo, E. , Wray, N. R. , Bulik, C. M. , Nordentoft, M. , Hougaard, D. M. , Werge, T. , Børglum, A. D. , Mortensen, P. B. , Privé, F. , & Vilhjálmsson, B. J. (2021). Leveraging both individual‐level genetic data and GWAS summary statistics increases polygenic prediction. The American Journal of Human Genetics, 108(6), 1001–1011. - PMC - PubMed
    1. Altman, N. , & Krzywinski, M. (2015). Points of significance: Association, correlation and causation. Nature Methods, 12(10), 899–900. - PubMed
    1. Argelaguet, R. , Velten, B. , Arnol, D. , Dietrich, S. , Zenz, T. , Marioni, J. C. , Buettner, F. , Huber, W. , & Stegle, O. (2018). Multi‐omics factor analysis—A framework for unsupervised integration of multi‐omics data sets. Molecular Systems Biology, 14(6), e8124. - PMC - PubMed
    1. Ashapkin, V. V. , Kutueva, L. I. , & Vanyushin, B. F. (2019). Epigenetic clock: Just a convenient marker or an active driver of aging? Reviews on Biomarker Studies in Aging and Anti‐Aging Research, 1178, 175–206. - PubMed
    1. Athey, S. , & Imbens, G. W. (2015). Machine learning methods for estimating heterogeneous causal effects. Stat, 1050(5), 1–26.

Publication types

LinkOut - more resources