Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Dec;24(12):834-850.
doi: 10.1038/s41576-023-00620-x. Epub 2023 Jun 27.

Incongruence in the phylogenomics era

Affiliations
Review

Incongruence in the phylogenomics era

Jacob L Steenwyk et al. Nat Rev Genet. 2023 Dec.

Abstract

Genome-scale data and the development of novel statistical phylogenetic approaches have greatly aided the reconstruction of a broad sketch of the tree of life and resolved many of its branches. However, incongruence - the inference of conflicting evolutionary histories - remains pervasive in phylogenomic data, hampering our ability to reconstruct and interpret the tree of life. Biological factors, such as incomplete lineage sorting, horizontal gene transfer, hybridization, introgression, recombination and convergent molecular evolution, can lead to gene phylogenies that differ from the species tree. In addition, analytical factors, including stochastic, systematic and treatment errors, can drive incongruence. Here, we review these factors, discuss methodological advances to identify and handle incongruence, and highlight avenues for future research.

PubMed Disclaimer

Conflict of interest statement

Competing interests

J.L.S. is a scientific consultant for Latch AI Inc. J.L.S. is a scientific advisor for WittGen Biotechnologies. J.L.S. is an advisor for ForensisGroup Inc. A.R. is a scientific consultant for LifeMine Therapeutics, Inc.

Figures

Figure 1 |
Figure 1 |. Incongruence at different levels of genomic organization.
a | The topology shown in blue supports a sister group relationship of taxa A and B, whereas the orange topology supports a sister group relationship of taxa A and C. The inference of such conflicting topologies defines incongruence. Incongruence can occur at different levels in the genome, such as among c | whole chromosomes (e.g., analyses of one chromosome support the blue topology but analyses of another support the orange topology), d | regions of a chromosome (dark grey regions represent lack of homology), e | genes (or loci), f | within a gene or locus (e.g., different domains support different topologies), and g | among sites in a multiple sequence alignment. Note that incongruence is also prevalent in other types of data (e.g., behavioral or morphological traits) and can occur at all evolutionary depths.
Figure 2 |
Figure 2 |. Major biological factors that contribute to incongruence.
Incomplete lineage sorting can lead to to gene trees that differ from the species phylogeny due to variation in the sorting of ancestral polymorphisms. Horizontal gene transfer, hybridization, and introgression can all lead to gene phylogenies that differ from the species tree. Recombination can result in loci with chimeric evolutionary histories. Duplication and loss can lead to hidden paralogy. Independently evolved traits in different phylogenetic lineages can be associated with convergent molecular evolution (green), contributing to incongruence.
Figure 3 |
Figure 3 |. Analytical factors can contribute to incongruence at every step in a phylogenomic workflow.
a | Taxon sampling can impact all downstream analyses in phylogenomic studies. b | During orthology inference, biases (e.g., sequence length biases) and analytical errors (e.g., erroneous orthology inferences) can contribute to incongruence. Each color corresponds to a unique ortholog present in each of the four taxa. c | Misalignment and excessive trimming of individual groups of orthologous genes can further decrease the accuracy of phylogenetic inferences. An example of erroneous ortholog inclusion is depicted using red font. d | Species tree inference via concatenation (left) or coalescence (right) is susceptible to multiple additional sources of error—complexity of model space, model misspecification, and inadequate model complexity, to name just a few.

References

    1. Simpson GG, 1902-. The principles of classification and a classification of mammals. ([American Museum of Natural History; ], 1945).
    1. Jarvis ED et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014). - PMC - PubMed
    1. Parks DH et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nature Biotechnology 36, 996–1004 (2018). - PubMed
    1. One Thousand Plant Transcriptomes Initiative. One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679–685 (2019). - PMC - PubMed
    1. Li Y et al. HGT is widespread in insects and contributes to male courtship in lepidopterans. Cell 185, 2975–2987.e10 (2022). - PMC - PubMed

Publication types

LinkOut - more resources