Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 21:11:22.
doi: 10.3389/fpls.2020.00022. eCollection 2020.

Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences

Affiliations

Plastome Evolution and Phylogeny of Orchidaceae, With 24 New Sequences

Young-Kee Kim et al. Front Plant Sci. .

Erratum in

Abstract

In order to understand the evolution of the orchid plastome, we annotated and compared 124 complete plastomes of Orchidaceae representing all the major lineages in their structures, gene contents, gene rearrangements, and IR contractions/expansions. Forty-two of these plastomes were generated from the corresponding author's laboratory, and 24 plastomes-including nine genera (Amitostigma, Bulbophyllum, Dactylorhiza, Dipodium, Galearis, Gymnadenia, Hetaeria, Oreorchis, and Sedirea)-are new in this study. All orchid plastomes, except Aphyllorchis montana, Epipogium aphyllum, and Gastrodia elata, have a quadripartite structure consisting of a large single copy (LSC), two inverted repeats (IRs), and a small single copy (SSC) region. The IR region was completely lost in the A. montana and G. elata plastomes. The SSC is lost in the E. aphyllum plastome. The smallest plastome size was 19,047 bp, in E. roseum, and the largest plastome size was 178,131 bp, in Cypripedium formosanum. The small plastome sizes are primarily the result of gene losses associated with mycoheterotrophic habitats, while the large plastome sizes are due to the expansion of noncoding regions. The minimal number of common genes among orchid plastomes to maintain minimal plastome activity was 15, including the three subunits of rpl (14, 16, and 36), seven subunits of rps (2, 3, 4, 7, 8, 11, and 14), three subunits of rrn (5, 16, and 23), trnC-GCA, and clpP genes. Three stages of gene loss were observed among the orchid plastomes. The first was ndh gene loss, which is widespread in Apostasioideae, Vanilloideae, Cypripedioideae, and Epidendroideae, but rare in the Orchidoideae. The second stage was the loss of photosynthetic genes (atp, pet, psa, and psb) and rpo gene subunits, which are restricted to Aphyllorchis, Hetaeria, Hexalectris, and some species of Corallorhiza and Neottia. The third stage was gene loss related to prokaryotic gene expression (rpl, rps, trn, and others), which was observed in Epipogium, Gastrodia, Lecanorchis, and Rhizanthella. In addition, an intermediate stage between the second and third stage was observed in Cyrtosia (Vanilloideae). The majority of intron losses are associated with the loss of their corresponding genes. In some orchid taxa, however, introns have been lost in rpl16, rps16, and clpP(2) without their corresponding gene being lost. A total of 104 gene rearrangements were counted when comparing 116 orchid plastomes. Among them, many were concentrated near the IRa/b-SSC junction area. The plastome phylogeny of 124 orchid species confirmed the relationship of {Apostasioideae [Vanilloideae (Cypripedioideae (Orchidoideae, Epidendroideae))]} at the subfamily level and the phylogenetic relationships of 17 tribes were also established. Molecular clock analysis based on the whole plastome sequences suggested that Orchidaceae diverged from its sister family 99.2 mya, and the estimated divergence times of five subfamilies are as follows: Apostasioideae (79.91 mya), Vanilloideae (69.84 mya), Cypripedioideae (64.97 mya), Orchidoideae (59.16 mya), and Epidendroideae (59.16 mya). We also released the first nuclear ribosomal (nr) DNA unit (18S-ITS1-5.8S-ITS2-28S-NTS-ETS) sequences for the 42 species of Orchidaceae. Finally, the phylogenetic tree based on the nrDNA unit sequences is compared to the tree based on the 42 identical plastome sequences, and the differences between the two datasets are discussed in this paper.

Keywords: IR contraction/expansion; Orchidaceae; gene loss; genome rearrangement; plastome evolution.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Plastome length variation in 24 newly sequenced orchid species. The plastome of Gastrodia elata is 35,056 bp long and consists of only one single copy region. The species names in red indicate mycoheterotrophic species.
Figure 2
Figure 2
Diagram of nuclear ribosomal (nr) DNA repeat units consisting of 18S-ITS1-5.8S-ITS2-28S-NTS-ETS. The total length of the unit is approximately 10 kb, and it is arranged as a tandem repeat. The nrDNA repeat sequences of 42 orchid species were first reported in this paper.
Figure 3
Figure 3
Six representative plastomes of Orchidaceae. Cypripedium formosanum has the longest plastome (178.1 kb), while Epipogium roseum has the shortest. Gastrodia elata and Epipogium roseum hold 27 genes in their plastome, even though they have substantially different plastome sizes. The heavy vertical bars indicate inverted repeat (IR) regions and the broken lines among plastomes indicate the boundaries of IR and single copy regions.
Figure 4
Figure 4
Relationships between plastome lengths and gene numbers. (A): Terrestrial orchids show a wider range of variation than epiphytic orchids. (B): Mycoheterotrophic orchids show a wider range of variation than photosynthetic orchids. (C, D): Plastome lengths are more strongly correlated with LSC lengths than IR lengths in both mycoheterotrophic and photosynthetic orchids.
Figure 5
Figure 5
Distribution patterns of gene loss in Orchidaceae. The dark blue, light blue, and white blocks indicate presence, pseudogene, and absence of each gene, respectively. The non-functionalization of 11 ndh genes are distributed widely across all taxonomic groups of Orchidaceae. This frequently occurs in Epidendroideae and Vanilloideae. The non-functionalization of psa, psb, pet, and rpo gene classes are confined to mycoheterotrophic lineages. In addition, the loss of housekeeping genes such as the rps, rpl, or trn gene classes occur independently in four genera, Epipogium, Gastrodia, Lecanorchis, and Rhizanthella. The species names in red indicate mycoheterotrophic orchids.
Figure 6
Figure 6
Evolution of gene losses in the phylogenetic tree of Orchidaceae. A total of 129 taxa—124 Orchidaceae and five outgroup taxa—were the subject of tree reconstruction. The sequences of 83 protein coding genes were concatenated to a length of 87,399 bp. A maximum likelihood (ML) tree was constructed using RaxML-HPC2 with a GTR+G+I model (ML = −672774.637133 of ML optimization likelihood). All the genes were then plotted on the tree node using parsimony criteria under the condition of no parallel gains of the same gene. The species names in red indicate mycoheterotrophic orchids. (A): The basal portion of the tree showing the subfamilies Apostasioideae, Vanilloideae, Cypripedioideae, and Orchidoideae. (B): The upper portions of the tree showing the subfamily Epidendroideae.
Figure 7
Figure 7
Distribution patterns of the loss of 20 introns in Orchidaceae. The dark blue, light blue, and white blocks indicate intron presence, absence without corresponding gene loss, and absence with corresponding gene loss, respectively.
Figure 8
Figure 8
Distribution of gene blocks. A total of 134 gene blocks were recognized in the 124 orchid plastomes. Thirty were constant blocks. 104 blocks show both forward (dark blue) and reverse (red) orientations and confirmed the block rearrangements. Blocks that are non-applicable because of gene losses or shifts are indicated in gray. All large rearrangements longer than 15 kb are located on the LSC. The species names in red indicate mycoheterotrophic orchids. (A): Forward and reverse orientations of each gene block in the orchid species. (B): Locations, lengths, and frequencies of 134 gene blocks.
Figure 9
Figure 9
Comparison of a plastid tree and nrDNA tree for the same 42 orchid species. The 83 aligned protein coding genes were 80,798 bp long. A maximum likelihood (ML) tree was constructed using RaxML-HPC2 with a GTR+G+I model with 100 bootstrap replicates. The best plastid tree was obtained with ML = −332414.394814. The nrDNA unit (18S-ITS1-5.8S-ITS2-28S) was approximately 10 kb long. The tree reconstruction methods for nrDNA were identical to those of the plastid tree. The species names in red indicate mycoheterotrophic orchids. The lines between the two trees indicate the topological differences between them.
Figure 10
Figure 10
Fossil data showing the estimated divergence time of each node. Three fossil data were used to calibrate nodes (Asparagales—mean 105.3 mya, Dendrobium—23.2 mya, and Goodyera—15.0 mya). Orchidaceae diverged from its sister family at 99.20 mya, and then five subfamilies subsequently diverged in the order of Apostasioideae (79.91 mya), Vanilloideae (69.84 mya), Cypripedioideae (64.97 mya), Orchidoideae (59.16 mya), and Epidendroideae (59.16 mya). However, several specious subtribes within the Epidendroideae diverged in relatively short time periods (39.57–28.13 mya). The species names in red indicate mycoheterotrophic orchids.

References

    1. Altschul S. F., Madden T. L., Schäffer A. A., Zhang J., Zhang Z., Miller W., et al. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. 10.1093/nar/25.17.3389 - DOI - PMC - PubMed
    1. Amiryousefi A., Hyvönen J., Poczai P. (2017). The plastid genome of Vanillon (Vanilla pompona, Orchidaceae). Mitochondrial DNA Part B 2, 689–691. 10.1080/23802359.2017.1383201 - DOI - PMC - PubMed
    1. Amiryousefi A., Hyvo J., Poczai P. (2018). Genome analysis IRscope: an online program to visualize the junction sites of chloroplast genomes. Bioinformatics 34, 3030–3031. 10.1093/bioinformatics/bty220 - DOI - PubMed
    1. Bankevich A., Nurk S., Antipov D., Gurevich A. A., Dvorkin M., Kulikov A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477. 10.1089/cmb.2012.0021 - DOI - PMC - PubMed
    1. Barrett C. F., Davis J. I. (2012). The plastid genome of the mycoheterotrophic Corallorhiza striata (Orchidaceae) is in the relatively early stages of degradation. Am. J. Bot. 99, 1513–1523. 10.3732/ajb.1200256 - DOI - PubMed