Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2017 Aug 1;12(8):e0182281.
doi: 10.1371/journal.pone.0182281. eCollection 2017.

Comparative analysis of complete plastid genomes from wild soybean (Glycine soja) and nine other Glycine species

Affiliations
Comparative Study

Comparative analysis of complete plastid genomes from wild soybean (Glycine soja) and nine other Glycine species

Sajjad Asaf et al. PLoS One. .

Abstract

The plastid genomes of different plant species exhibit significant variation, thereby providing valuable markers for exploring evolutionary relationships and population genetics. Glycine soja (wild soybean) is recognized as the wild ancestor of cultivated soybean (G. max), representing a valuable genetic resource for soybean breeding programmes. In the present study, the complete plastid genome of G. soja was sequenced using Illumina paired-end sequencing and then compared it for the first time with previously reported plastid genome sequences from nine other Glycine species. The G. soja plastid genome was 152,224 bp in length and possessed a typical quadripartite structure, consisting of a pair of inverted repeats (IRa/IRb; 25,574 bp) separated by small (178,963 bp) and large (83,181 bp) single-copy regions, with a 51-kb inversion in the large single-copy region. The genome encoded 134 genes, including 87 protein-coding genes, eight ribosomal RNA genes, and 39 transfer RNA genes, and possessed 204 randomly distributed microsatellites, including 15 forward, 25 tandem, and 34 palindromic repeats. Whole-plastid genome comparisons revealed an overall high degree of sequence similarity between G. max and G. gracilis and some divergence in the intergenic spacers of other species. Greater numbers of indels and SNP substitutions were observed compared with G. cyrtoloba. The sequence of the accD gene from G. soja was highly divergent from those of the other species except for G. max and G. gracilis. Phylogenomic analyses of the complete plastid genomes and 76 shared genes yielded an identical topology and indicated that G. soja is closely related to G. max and G. gracilis. The complete G. soja genome sequenced in the present study is a valuable resource for investigating the population and evolutionary genetics of Glycine species and can be used to identify related species.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have no competing interest.

Figures

Fig 1
Fig 1. Gene map of the Glycine soja plastid genome.
Thick lines in the red area indicate the extent of the inverted repeat regions (IRa and IRb; 25,574 bp), which separate the genome into small (SSC; 17,896 bp) and large (LSC; 83,181 bp) single-copy regions. Genes located inside the circle are transcribed clockwise, and those outside the circle are transcribed counterclockwise. Genes belonging to different functional groups are colour-coded. The dark grey in the inner circle corresponds to the GC content, and the light grey corresponds to the AT content. The green colour arc indicates the location of the 51-kb inversion.
Fig 2
Fig 2. Analysis of repeated sequences in 10 Glycine plastid genomes.
A, Total of three repeat types; B, Length distribution of forward repeat sequences; C, Length distribution of tandem repeat sequences; D, Length distribution of palindromic repeat sequences.
Fig 3
Fig 3. Analysis of simple sequence repeats (SSRs) in the ten Glycine plastid genomes.
A, Number of SSR types; B, Frequency of identified SSR motifs in different repeat class types; C, Frequency of identified SSRs in coding regions; D, Frequency of identified SSRs in the small single-copy (SSC), large simple-copy (LSC), and inverted repeat (IR) regions.
Fig 4
Fig 4. Visual alignment of plastid genomes from Glycine soja (new and old) and nine other Glycine species.
VISTA-based identity plot showing the sequence identity among the ten Glycine species, using G. soja (new) as a reference. Vertical scale indicates the percentage of identity, ranging from 50% to 100%. Horizontal axis indicates the coordinates within the chloroplast genome. Arrows indicate the annotated genes and their transcriptional direction. A thick black line indicates the inverted repeat (IR) regions.
Fig 5
Fig 5. Pairwise distance of 76 genes from Glycine soja (new and old) and nine other Glycine species.
Fig 6
Fig 6. Distance between adjacent genes and junctions of the small single-copy (SSC), large single-copy (LSC), and two inverted repeat (IR) regions of the plastid genomes from ten Glycine species.
Boxes above and below the main line indicate the adjacent bordering genes. The figure is not to scale in regard to sequence length and only shows relative changes at or near the IR/SC borders.
Fig 7
Fig 7. Phylogenetic trees of ten Glycine species.
The whole-genome dataset was analysed using four different methods: neighbour-joining (NJ), maximum parsimony (MP), maximum likelihood (ML), and Bayesian inference (BI). Numbers above the branches represent bootstrap values in the NJ, MP, and ML trees and posterior probabilities in the BI trees. A red dot represents the position of G. soja (KY241814).

References

    1. Neuhaus HE, Emes MJ. NONPHOTOSYNTHETIC METABOLISM IN PLASTIDS. Annu Rev Plant Physiol Plant Mol Biol. 2000;51:111–40. Epub 2004/03/12. doi: 10.1146/annurev.arplant.51.1.111 . - DOI - PubMed
    1. Rodriguez-Ezpeleta N, Brinkmann H, Burey SC, Roure B, Burger G, Loffelhardt W, et al. Monophyly of primary photosynthetic eukaryotes: green plants, red algae, and glaucophytes. Curr Biol. 2005;15(14):1325–30. Epub 2005/07/30. doi: 10.1016/j.cub.2005.06.040 . - DOI - PubMed
    1. Palmer JD. Plastid chromosomes: structure and evolution1991.
    1. Henry RJ. Plant diversity and evolution: genotypic and phenotypic variation in higher plants: Cabi Publishing; 2005.
    1. Martin G, Baurens FC, Cardi C, Aury JM, D'Hont A. The Complete Chloroplast Genome of Banana (Musa acuminata, Zingiberales): Insight into Plastid Monocotyledon Evolution. Plos One. 2013;8(6). doi: 10.1371/journal.pone.0067350 - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources