Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 Mar 9;38(3):891-903.
doi: 10.1093/molbev/msaa251.

Interrogating Genomic-Scale Data to Resolve Recalcitrant Nodes in the Spider Tree of Life

Affiliations
Comparative Study

Interrogating Genomic-Scale Data to Resolve Recalcitrant Nodes in the Spider Tree of Life

Siddharth Kulkarni et al. Mol Biol Evol. .

Abstract

Genome-scale data sets are converging on robust, stable phylogenetic hypotheses for many lineages; however, some nodes have shown disagreement across classes of data. We use spiders (Araneae) as a system to identify the causes of incongruence in phylogenetic signal between three classes of data: exons (as in phylotranscriptomics), noncoding regions (included in ultraconserved elements [UCE] analyses), and a combination of both (as in UCE analyses). Gene orthologs, coded as amino acids and nucleotides (with and without third codon positions), were generated by querying published transcriptomes for UCEs, recovering 1,931 UCE loci (codingUCEs). We expected that congeners represented in the codingUCE and UCEs data would form clades in the presence of phylogenetic signal. Noncoding regions derived from UCE sequences were recovered to test the stability of relationships. Phylogenetic relationships resulting from all analyses were largely congruent. All nucleotide data sets from transcriptomes, UCEs, or a combination of both recovered similar topologies in contrast with results from transcriptomes analyzed as amino acids. Most relationships inferred from low-occupancy data sets, containing several hundreds of loci, were congruent across Araneae, as opposed to high occupancy data matrices with fewer loci, which showed more variation. Furthermore, we found that low-occupancy data sets analyzed as nucleotides (as is typical of UCE data sets) can result in more congruent relationships than high occupancy data sets analyzed as amino acids (as in phylotranscriptomics). Thus, omitting data, through amino acid translation or via retention of only high occupancy loci, may have a deleterious effect in phylogenetic reconstruction.

Keywords: Araneae; noncoding regions; phylogeny; target-capture; transcriptomics.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Maximum likelihood phylogeny of spiders resulting from the AllUCEs10 data set (occupancy 10, 1,060 loci) collapsed to family level. Paraphyly is indicated by vertical violet bars. (A) All major lineages of spiders at family level except the RTA Clade and Araneoidea; (B) RTA Clade; (C) All 17 families of superfamily Araneoidea. The rhombi at the nodes indicate four support values: Shimodaira–Hasegawa-like approximate likelihood ratio test (left top), ultrafast bootstrap (right top), gene concordance factor (gCF) (left bottom), and site concordance factor (sCF) (right bottom). The numbers at the node indicate clades as described. Branch lengths are not to be scaled. For the original sampled tree, see supplementary figure 2, Supplementary Material online.
Fig. 2.
Fig. 2.
Maximum likelihood phylogenies of spiders resulting from different data sets at various occupancies. Each colored box indicates a data set corresponding to supplementary table 2, Supplementary Material online. The first and second rows represent phylogenies resulting from data analyzed as nucleotides and amino acids, respectively, of codingUCEs (outlined red) and AllUCEs (outlined blue).
Fig. 3.
Fig. 3.
Comparison of phylogenetic relationships between (A) transcriptomic phylogeny as published by Fernández et al. (2018) using amino acids, and (B) nucT (Fernández et al. 2018, transcriptome data set analyzed as nucleotides). Both phylogenies were constructed using occupancy of 67%. The highlighted blue box indicates Araneoidea families.
Fig. 4.
Fig. 4.
Comparison of interfamilial relationships of Araneoidea. (A) AllAAUCEs tree, (B) AllUCEs tree. Occupancy of both phylogenies was 10%. Colored branches indicate family relationships that are congruent in both trees.
Fig. 5.
Fig. 5.
Schematic representation of data classes analyzed in this study in a maximum likelihood framework. Squares indicate original data sets from Fernández et al. (2018) and Kulkarni et al. (2020), and circles indicate matrices analyzed in our study. Circles with red outline indicate amino acid data set, black outline indicates noncoding region data set, and the circles with outline indicate nucleotide data sets. UCE, ultraconserved elements.

Similar articles

Cited by

References

    1. Arnedo MA, Hormiga G, Scharff N.. 2009. Higher-level phylogenetics of linyphiid spiders (Araneae, Linyphiidae) based on morphological and molecular evidence. Cladistics 25(3):231–262. - PubMed
    1. Babb PL, Lahens NF, Correa-Garhwal SM, Nicholson DN, Kim EJ, Hogenesch JB, Kuntner M, Higgins L, Hayashi CY, Agnarsson I, et al.2017. The Nephila clavipes genome highlights the diversity of spider silk genes and their complex expression. Nat Genet. 49(6):895–903. - PubMed
    1. Ballesteros JA, Sharma P.2019. A critical appraisal of the placement of Xiphosura (Chelicerata) with account of known sources of phylogenetic error. Syst Biol. 68(6):896–917. - PubMed
    1. Bond JE, Garrison NL, Hamilton CA, Godwin RL, Hedin M, Agnarsson I.. 2014. Phylogenomics resolves a spider backbone phylogeny and rejects a prevailing paradigm for orb web evolution. Curr Biol. 24(15):1765–1771. - PubMed
    1. Bossert S, Murray EA, Almeida EAB, Brady SG, Blaimer BB, Danforth BN.. 2019. Combining transcriptomes and ultraconserved elements to illuminate the phylogeny of Apidae. Mol Phylogenet Evol. 130:121–131. - PubMed

Publication types