Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 1;17(1):124.
doi: 10.1186/s13104-024-06786-9.

18S rDNA sequence-structure phylogeny of the eukaryotes simultaneously inferred from sequences and their individual secondary structures

Affiliations

18S rDNA sequence-structure phylogeny of the eukaryotes simultaneously inferred from sequences and their individual secondary structures

Eva Rapp et al. BMC Res Notes. .

Abstract

Objective: The eukaryotic tree of life has been subject of numerous studies ever since the nineteenth century, with more supergroups and their sister relations being decoded in the last years. In this study, we reconstructed the phylogeny of eukaryotes using complete 18S rDNA sequences and their individual secondary structures simultaneously. After the sequence-structure data was encoded, it was automatically aligned and analyzed using sequence-only as well as sequence-structure approaches. We present overall neighbor-joining trees of 211 eukaryotes as well as the respective profile neighbor-joining trees, which helped to resolve the basal branching pattern. A manually chosen subset was further inspected using neighbor-joining, maximum parsimony, and maximum likelihood analyses. Additionally, the 75 and 100 percent consensus structures of the subset were predicted.

Results: All sequence-structure approaches show improvements compared to the respective sequence-only approaches: the average bootstrap support per node of the sequence-structure profile neighbor-joining analyses with 90.3, was higher than the average bootstrap support of the sequence-only profile neighbor-joining analysis with 73.9. Also, the subset analyses using sequence-structure data were better supported. Furthermore, more subgroups of the supergroups were recovered as monophyletic and sister group relations were much more comparable to results as obtained by multi-marker analyses.

Keywords: 18S rDNA; Eukaryotes; Phylogenetics; Secondary structure.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Left: Encoding of sequence-structure information. Scoring matrices and substitution models have been adapted accordingly. The figure shows an RNA sequence with its individual secondary structure in the bracket-dot-bracket notation. The respective 2D structure, the 12-letter translation table as well as the one-letter-encoded pseudoprotein sequence are depicted. Right: Different alignments are shown. They differ in terms of informational content (exemplarily highlighted in red). Only the sequence-structure-alignments as derived from 4SALE [–22] include information about individual secondary structures whereas the guided-sequence alignment is guided only by a consensus structure
Fig. 2
Fig. 2
Overall sequence-structure NJ tree using the 18S rDNA of all 211 taxa. ClustalW [19], as implemented in 4SALE [20, 21], was used for the global multiple sequence-structure alignment. The tree was reconstructed using ProfDistS [25, 26] and midpoint rooted. The scale bar shows evolutionary distances. Taxa names are accompanied by their corresponding GenBank accession number. Clades and respective singular taxa are marked in a color-scheme based on the eukaryotic tree of life published by Keeling and Burki [2]. If clades and singular taxa do not form one monophyletic group, they are numbered consecutively. If a group is only represented by one taxon, the taxon is marked in red. Taxa which were manually selected for the subsampling are marked bold. Supergroups are indicated according to Burki et al. [3] and Keeling and Burki [2], the names of the supergroups are adapted based on Adl. et al. [27]. With regards to readability the supergroups Amorphea, Obazoa and Opisthokonta are only named once near the biggest monophyletic subgroup. The three supergroups are marked with quotation marks since they are not monophyletic. The supergroup Opisthokonta includes Fungi, Metazoa, Choanoflagellata and Ichthyosporea. Amoebozoa are classified as Obazoa
Fig. 3
Fig. 3
Two-times iterated sequence-structure PNJ tree with BS values (A) and original BL (B). Profiles were predefined according to Fig. 2; singletons were not included. The scale bar shows evolutionary distances. The trees were reconstructed using ProfDistS [25, 26] and rooted according to the overall sequence-structure NJ tree (Fig. 2). In each iteration, super-profiles of profiles have been built based on BS values (> 75). At internodes, the BS values from 100 pseudo-replicates have been mapped. The numbers in the triangles in front of the taxa represent the quantity of taxa included in the profile

Similar articles

References

    1. Simpson AGB, Roger AJ. The real 'Kingdoms' of eukaryotes. Curr Biol. 2004 doi: 10.1016/j.cub.2004.08.038. - DOI - PubMed
    1. Keeling PJ, Burki F. Progress towards the Tree of Eukaryotes. Curr Biol. 2019 doi: 10.1016/j.cub.2019.07.031. - DOI - PubMed
    1. Burki F, Roger AJ, Brown MW, Simpson AGB. The new tree of eukaryotes. Trends Ecol Evol. 2020 doi: 10.1016/j.tree.2019.08.008. - DOI - PubMed
    1. Xie Q, Lin J, Qin Y, Zhou J, Bu W. Structural diversity of eukaryotic 18S rRNA and its impact on alignment and phylogenetic reconstruction. Protein Cell. 2011 doi: 10.1007/s13238-011-1017-2. - DOI - PMC - PubMed
    1. Sayers EW, Bolton EE, Brister JR, Canese K, Chan J, Comeau DC, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022 doi: 10.1093/nar/gkab1112. - DOI - PMC - PubMed

LinkOut - more resources