Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2010 Aug;22(8):2545-61.
doi: 10.1105/tpc.110.074229. Epub 2010 Aug 20.

Structural and functional divergence of a 1-Mb duplicated region in the soybean (Glycine max) genome and comparison to an orthologous region from Phaseolus vulgaris

Affiliations
Comparative Study

Structural and functional divergence of a 1-Mb duplicated region in the soybean (Glycine max) genome and comparison to an orthologous region from Phaseolus vulgaris

Jer-Young Lin et al. Plant Cell. 2010 Aug.

Abstract

Soybean (Glycine max) has undergone at least two rounds of polyploidization, resulting in a paleopolyploid genome that is a mosaic of homoeologous regions. To determine the structural and functional impact of these duplications, we sequenced two ~1-Mb homoeologous regions of soybean, Gm8 and Gm15, derived from the most recent ~13 million year duplication event and the orthologous region from common bean (Phaseolus vulgaris), Pv5. We observed inversions leading to major structural variation and a bias between the two chromosome segments as Gm15 experienced more gene movement (gene retention rate of 81% in Gm15 versus 91% in Gm8) and a nearly twofold increase in the deletion of long terminal repeat (LTR) retrotransposons via solo LTR formation. Functional analyses of Gm15 and Gm8 revealed decreases in gene expression and synonymous substitution rates for Gm15, for instance, a 38% increase in transcript levels from Gm8 relative to Gm15. Transcriptional divergence of homoeologs was found based on expression patterns among seven tissues and developmental stages. Our results indicate asymmetric evolution between homoeologous regions of soybean as evidenced by structural changes and expression variances of homoeologous genes.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Annotation Results of Chromosome Segments from Soybean (Gm8 and Gm15) and Phaseolus (Pv5). The two soybean homoeologous regions, Gm8 and Gm15, and orthologous Phaseolus region, Pv5, are shown. Black bars are sequence contigs. Gm8 has two physical gaps, and Pv5 has one. Pentagons represent genes. Orange genes have homoeologs and/or orthologs (collinear genes), connected by black lines. Yellow genes do not have homoeologs and orthologs (noncollinear genes). Triangles represent transposons: blue triangles are intact LTR retrotranspsons, pink triangles are solo LTRs, cyan triangles are truncated LTR retrotransposons, and green triangles are non-LTR retrotransposons. Shaded boxes show the duplication events: green boxes show segmental duplications. The purple box shows a tandem duplication. Black numbers are the gene number from annotation results. Insertion times of intact LTR retrotransposons are shown in red. Regions between the two pink dotted lines are physical gaps in Gm8 and their corresponding intervals in Gm15.
Figure 2.
Figure 2.
Distribution of Ks Values of Genes in Both Soybean Homoeologs and Phaseolus. Blue circles are Ks values from Pv5-Gm8; green triangles are Ks values from Pv5-Gm15; red boxes are Ks values from Gm8-Gm15. Different zones show the Ks from different homoeologous and orthologous counterparts. Zone I shows the Ks from the pairwise comparison of the two soybean homoeologs and one Phaseolus ortholog. Zone II shows the Ks from the sequence comparisons of Pv5 orthologs and Gm8 homoeologs. Zone III shows the Ks from Pv5 orthologs and Gm15 homoeolog comparisons. Zone IV shows the Ks from homoeologs between Gm8 and Gm15. Numbers below the plot are the annotated gene numbers. Ks value of gene114 (Pv5-Gm15) is ~2.8 due to extensive variation outside the conserved protein domain, and it is not shown here.
Figure 3.
Figure 3.
Heat Maps of Relative Transcript Abundance of Homoeologous Genes on Gm8 and Gm15 for Seven Different Tissue Types. The relative positions of 29 tested genes (top to bottom) are shown as colored rectangles along the Gm8 and Gm15 contigs. The numbers on the left are gene annotation results of the 29 tested genes. The transcription level of each gene is indicated as high (red) or low (blue) relative to the respective homoeologous counterpart. Homoeologs with approximately equal transcription levels are shown as black. The scale on the right indicates the coloration used to depict relative transcript proportions between homoeologous copies.
Figure 4.
Figure 4.
Assessment of Tissue-Specific Transcriptional Divergence between Homoeolog Pairs. The relative transcriptional proportions of the Gm8 homoeologs are shown for 29 tested gene pairs. The mean Gm8 proportion across two biological replicates is shown in the plot for each gene pair across seven tissue types. The percentage of statistically significant (P < 0.05) tissue × tissue differences detected in a factorial pairwise comparison of the homoeolog proportions among seven tissue types is shown below each gene pair number. There was a wide range (2 to 67%) of transcription variations among homoeolog pairs. The genes are ordered from lowest to highest percentage of tissue × tissue differences among the seven tissue types. Therefore, the gene pairs on the left infrequently exhibited significant tissue-specific transcriptional variation, and the genes on the right frequently exhibited significant tissue-specific transcriptional variation. The red number shown below each percentage indicates the number of informative assays for each homoeolog pair.

Similar articles

Cited by

References

    1. Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. (1990). Basic local alignment search tool. J. Mol. Biol. 215: 403–410 - PubMed
    1. Ammiraju J.S., et al. (2008). Dynamic evolution of oryza genomes is revealed by comparative genomic analysis of a genus-wide vertical data set. Plant Cell 20: 3191–3209 - PMC - PubMed
    1. Arndt P.F., Hwa T., Petrov D.A. (2005). Substantial regional variation in substitution rates in the human genome: Importance of GC content, gene density, and telomere-specific effects. J. Mol. Evol. 60: 748–763 - PubMed
    1. Baer C.F., Miyamoto M.M., Denver D.R. (2007). Mutation rate variation in multicellular eukaryotes: Causes and consequences. Nat. Rev. Genet. 8: 619–631 - PubMed
    1. Benovoy D., Drouin G. (2006). Processed pseudogenes, processed genes, and spontaneous mutations in the Arabidopsis genome. J. Mol. Evol. 62: 511–522 - PubMed

Publication types

LinkOut - more resources