Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jul 1;69(4):660-670.
doi: 10.1093/sysbio/syz080.

The Implications of Lineage-Specific Rates for Divergence Time Estimation

Affiliations

The Implications of Lineage-Specific Rates for Divergence Time Estimation

Tom Carruthers et al. Syst Biol. .

Abstract

Rate variation adds considerable complexity to divergence time estimation in molecular phylogenies. Here, we evaluate the impact of lineage-specific rates-which we define as among-branch-rate-variation that acts consistently across the entire genome. We compare its impact to residual rates-defined as among-branch-rate-variation that shows a different pattern of rate variation at each sampled locus, and gene-specific rates-defined as variation in the average rate across all branches at each sampled locus. We show that lineage-specific rates lead to erroneous divergence time estimates, regardless of how many loci are sampled. Further, we show that stronger lineage-specific rates lead to increasing error. This contrasts to residual rates and gene-specific rates, where sampling more loci significantly reduces error. If divergence times are inferred in a Bayesian framework, we highlight that error caused by lineage-specific rates significantly reduces the probability that the 95% highest posterior density includes the correct value, and leads to sensitivity to the prior. Use of a more complex rate prior-which has recently been proposed to model rate variation more accurately-does not affect these conclusions. Finally, we show that the scale of lineage-specific rates used in our simulation experiments is comparable to that of an empirical data set for the angiosperm genus Ipomoea. Taken together, our findings demonstrate that lineage-specific rates cause error in divergence time estimates, and that this error is not overcome by analyzing genomic scale multilocus data sets. [Divergence time estimation; error; rate variation.].

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A summary of different classes of molecular evolutionary rate variation. The trees shown are phylograms, and in all cases taxa are sampled at the present. In all trees, the two divergence events occurred at time formula image in the past, and time formula image in the past. Therefore, branch length variation reflects molecular evolutionary rate variation. a) Lineage-specific rates. Among-branch-rate-variation is observed in each gene tree, but the pattern of among-branch-rate-variation is the same for each gene tree. The average rate for each branch therefore differs in the species tree. b) Gene-specific rates. The average rate across all branches differs for each gene tree but does so consistently for all branches. The average rate for each branch is therefore the same in the species tree. c) Residual rates. Among-branch-rate-variation is observed in all gene trees, but the pattern of among-branch-rate-variation is different for each gene tree. If a large number of loci are sampled, the average rate for each branch is the same in the species tree. d) Residual rates and lineage-specific rates. Because of the underlying lineage-specific rate, the average rate for each branch differs in the species tree.
Figure 2.
Figure 2.
Summary of results from simulation experiments with different classes of rate variation and different numbers of sampled loci (each locus has a length of 800 bp). The unknown divergence time is inferred with a UCLN relaxed clock. Mean posterior age estimates from each experiment are plotted within bins of 0.2. The length of each plotted line corresponds to the number of estimates within each bin (see key). The line is plotted in gray if for more than 50% of replicate experiments plotted within a bin, the 95% HPD did not include the correct value. Otherwise the line is plotted in black. a) No rate variation, b) gene-specific rates, c) residual rates, d) lineage-specific rates, and d) lineage-specific rates and residual rates.
Figure 3.
Figure 3.
The impact of increasingly strong lineage-specific rates on divergence time estimates. Points are plotted according to the same bins, sizes, and colors as Figure 2. In all cases, the unknown divergence time is inferred from 400 loci. a) formula image of the UCLN relaxed clock is corrected such that it exactly matches formula image of the distribution from which lineage-specific rates are simulated. b) formula image of the UCLN relaxed clock is fixed at 0.15.

Similar articles

Cited by

References

    1. Alfaro M.E., Faircloth B.C., Harrington R.C., Sorenson L., Friedman M., Thacker C.E., Oliveros C.H., Černý D., Near T.J.. 2018. Explosive diversification of marine fishes at the Cretaceous-Palaeogene boundary. Nat. Ecol. Evol. 2:688–696. - PubMed
    1. Aris-Brosou S., Yang Z.. 2003. Bayesian models of episodic evolution support a late precambrian explosive diversification of the metazoa. Mol. Biol. Evol. 20:1947–1954. - PubMed
    1. Berv J.S., Field D.J.. 2018. Genomic signature of an avian Lilliput effect across the K-Pg extinction. Syst. Biol. 67:1–13. - PMC - PubMed
    1. Bolker B., Butler M., Cowan P., de Vienne D., Eddelbuettel D., Holder M., Jombart T., Kembel S., Michonneau F., Orme D., O’Meara B., Paradis E., Regetz J., Zwickl D.. 2017. phylobase: base package for phylogenetic structures and comparative data. R package version 0.8.4 Available from: https://CRAN.R-project.org/package=phylobase.
    1. Britten R.J. 1984. Rates of DNA sequence evolution differ between taxonomic groups. Science 39:1393–1398. - PubMed

Publication types