Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep;1(9):1370-1378.
doi: 10.1038/s41559-017-0240-5. Epub 2017 Jul 24.

Phylotranscriptomic consolidation of the jawed vertebrate timetree

Affiliations

Phylotranscriptomic consolidation of the jawed vertebrate timetree

Iker Irisarri et al. Nat Ecol Evol. 2017 Sep.

Abstract

Phylogenomics is extremely powerful but introduces new challenges as no agreement exists on "standards" for data selection, curation and tree inference. We use jawed vertebrates (Gnathostomata) as model to address these issues. Despite considerable efforts in resolving their evolutionary history and macroevolution, few studies have included a full phylogenetic diversity of gnathostomes and some relationships remain controversial. We tested a novel bioinformatic pipeline to assemble large and accurate phylogenomic datasets from RNA sequencing and find this phylotranscriptomic approach successful and highly cost-effective. Increased sequencing effort up to ca. 10Gbp allows recovering more genes, but shallower sequencing (1.5Gbp) is sufficient to obtain thousands of full-length orthologous transcripts. We reconstruct a robust and strongly supported timetree of jawed vertebrates using 7,189 nuclear genes from 100 taxa, including 23 new transcriptomes from previously unsampled key species. Gene jackknifing of genomic data corroborates the robustness of our tree and allows calculating genome-wide divergence times by overcoming gene sampling bias. Mitochondrial genomes prove insufficient to resolve the deepest relationships because of limited signal and among-lineage rate heterogeneity. Our analyses emphasize the importance of large curated nuclear datasets to increase the accuracy of phylogenomics and provide a reference framework for the evolutionary history of jawed vertebrates.

Keywords: Gnathostomata; RNA-Seq; cross-validation; jackknifing; molecular dating; phylogeny; substitution rates; transcriptome.

PubMed Disclaimer

Conflict of interest statement

Competing financial interests The authors declare no competing financial interests.

Figures

Figure 1
Figure 1
Transcriptome sequencing effort and performance in phylogenomic dataset assembly. Histogram represent sequencing effort as total number of sequenced (clean) Mbp (million bp). Transcriptome completeness is measured as the proportion of recovered core vertebrate genes (233 CVG; Hara et al.29). Genes effectively usable for phylogenomics are approximated by the proportion of human proteins reconstructed at full (100%) and nearly full (>70%) lengths (in proportion to a total of 22,964 human genes). The completeness the relevant species in our final phylogenomic dataset is shown as the proportion of amino acids across all 7,189 genes (3,791,500 aligned amino acids in total).
Figure 2
Figure 2
Backbone phylogeny of jawed vertebrates. (a) Bayesian majority-rule consensus tree from 100 independent MCMC chains derived from gene jackknife replicates (~50,000 amino acid positions each) of the NoDP nuclear dataset, estimated by PhyloBayes under the CAT+Γ model. All nodes received full gene jackknife support (100%), except those displaying the actual value. The scale bar corresponds to the expected number of substitutions per site. Asterisks denote new transcriptomic data generated in this study. (b) Effect of alignment length on the recovery of single nodes in the phylogeny assessed by gene jackknife proportions derived from the NoDP dataset.
Figure 3
Figure 3
Time-calibrated phylogeny of jawed vertebrates. Divergences have been averaged across 100 timetrees estimated from independent gene jackknife replicates in PhyloBayes, using the subset of most congruent calibrations (C16; marked by arrows) and best-fit evolutionary (CAT-GTR+Γ) and relaxed clock (autocorrelated lognormal) models. Credibility intervals (CrI) are calculated as the absolute maximum and minimum values of 95% confidence intervals across 100 timetrees (only displayed for key nodes; see Supplementary Table 9 for detailed results). The dimensions of the scale is given in million years and main geological periods are highlighted.

References

    1. Brown JM, Thomson RC. Bayes factors unmask highly variable information content, bias, and extreme influence in phylogenomic analyses. Syst Biol. 2017;66:517–530. doi: 10.1093/sysbio/syw101. - DOI - PubMed
    1. Jeffroy O, Brinkmann H, Delsuc F, Hervé P. Phylogenomics: The beginning of incongruence? Trends Genet. 2006;22:225–231. - PubMed
    1. Zardoya R, Meyer A. The evolutionary position of turtles revised. Naturwissenschaften. 2001;88:193–200. - PubMed
    1. Janke A, Erpenbeck D, Nilsson M, Arnason U. The mitochondrial genomes of the iguana (Iguana iguana) and the caiman (Caiman crocodylus): Implications for amniote phylogeny. Proc R Soc B-Biol Sci. 2001;268:623–631. - PMC - PubMed
    1. Near TJ, et al. Phylogeny and tempo of diversification in the superradiation of spiny-rayed fishes. Proc Natl Acad Sci USA. 2013;110:12738–12743. - PMC - PubMed