Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Oct 3:13:148.
doi: 10.1186/1471-2229-13-148.

The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups

Affiliations

The large soybean (Glycine max) WRKY TF family expanded by segmental duplication events and subsequent divergent selection among subgroups

Guangjun Yin et al. BMC Plant Biol. .

Abstract

Background: WRKY genes encode one of the most abundant groups of transcription factors in higher plants, and its members regulate important biological process such as growth, development, and responses to biotic and abiotic stresses. Although the soybean genome sequence has been published, functional studies on soybean genes still lag behind those of other species.

Results: We identified a total of 133 WRKY members in the soybean genome. According to structural features of their encoded proteins and to the phylogenetic tree, the soybean WRKY family could be classified into three groups (groups I, II, and III). A majority of WRKY genes (76.7%; 102 of 133) were segmentally duplicated and 13.5% (18 of 133) of the genes were tandemly duplicated. This pattern was not apparent in Arabidopsis or rice. The transcriptome atlas revealed notable differential expression in either transcript abundance or in expression patterns under normal growth conditions, which indicated wide functional divergence in this family. Furthermore, some critical amino acids were detected using DIVERGE v2.0 in specific comparisons, suggesting that these sites have contributed to functional divergence among groups or subgroups. In addition, site model and branch-site model analyses of positive Darwinian selection (PDS) showed that different selection regimes could have affected the evolution of these groups. Sites with high probabilities of having been under PDS were found in groups I, II c, II e, and III. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean.

Conclusions: In this work, all the WRKY genes, which were generated mainly through segmental duplication, were identified in the soybean genome. Moreover, differential expression and functional divergence of the duplicated WRKY genes were two major features of this family throughout their evolutionary history. Positive selection analysis revealed that the different groups have different evolutionary rates. Together, these results contribute to a detailed understanding of the molecular evolution of the WRKY gene family in soybean.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Chromosome distribution of soybean (Glycine max) WRKY genes. The size of a chromosome is indicated by its relative length. Red outlined boxes represent segmentally duplicated genes. Tandem duplicated genes are indicated with vertical green lines. The location information and chromosome information were obtained from Phytozome. The figure was produced using the MapInspector program.
Figure 2
Figure 2
Alignment of multiple soybean WRKY genes and selectedAtWRKY domain amino acid sequences. Alignment was performed using Clustal W. The suffixes ‘N’ or ‘C’ denote the N-terminal and C-terminal WRKY domains from Group I WRKY proteins, respectively. The amino acids forming the zinc-finger motif are highlighted in yellow. The conserved WRKY amino acid signature is highlighted in blue. The four β-strands are shown in red. The position of a conserved intron is indicated by an arrowhead.
Figure 3
Figure 3
Phylogenetic tree of WRKY domains among soybean andArabidopsis. The amino acid sequences of the WRKY domain of soybean and Arabidopsis were aligned with Clustal W and the phylogenetic tree was constructed using the maximum likelihood method in MEGA 5.0. Group 1 proteins with the suffix ‘N’ or ‘C’ indicate the N-terminal WRKY domains or the C-terminal WRKY domains, respectively. Genes with similar functions clustered together are indicated by filled green circles. Gene expansion in soybean and Arabidopsis are indicated by coloring the subclade with the same color as the leaf label. The red arcs indicate different groups (or subgroups) of WRKY domains.
Figure 4
Figure 4
Expression profiles of 127 soybean WRKY genes. The hierarchical cluster color code: the largest values are displayed as the reddest (hot), the smallest values are displayed as the bluest (cool), and the intermediate values are a lighter color of either blue or red. Pearson correlation clustering was used to group the developmentally regulated genes. Six genes were excluded from the analysis due to no expression in an organ or a period.
Figure 5
Figure 5
Model building of the 3D structure of the soybean WRKY protein (Glyma13g00380) based on similarity to theAtWRKY4-C domain (Protein Data Bank (PDB) code: 2lexA). The ensemble of the selected structures in stereo view (A), (B), (C), and (D) positive selection sites detected by the branch-site model presented in group I, group II c, group II e, and group III, respectively. The sites with red color indicate amino acid residues under statistically significant (p < 0.05) positive selection, as calculated by Bayes Empirical Bayes estimation methods. The presented region is Asp247–Pro306, excluding the N-terminal region. The figure was produced using the Swiss-model and pyMOL programs.

References

    1. Zhang Y, Wang L. The WRKY transcription factor superfamily: its origin in eukaryotes and expansion in plants. BMC Evol Biol. 2005;5:1. doi: 10.1186/1471-2148-5-1. - DOI - PMC - PubMed
    1. Ishiguro N. Characterization of a cDNA encoding a novel DNA-binding protein, SPF1, that recognizes SP8 sequences in the 5’ upstream regions of genes coding for sporamin and β-amylase from sweet potato. Mol Gen Genet. 1994;244:563–571. - PubMed
    1. Rushton PJ, Macdonald H, Huttly AK, Lazarus CM, Hooley R. Members of a new family of DNA binding proteins bind to a conserved cis-element in the promoters of alpha-Amy2 genes. Plant Mol Biol. 1995;29:691–702. doi: 10.1007/BF00041160. - DOI - PubMed
    1. Rushton PJ, Torres JT, Parniske M, Wernert P, Hahlbrock K, Somssich IE. Interaction of elicitor-induced DNA-binding proteins with elicitor response elements in the promoters of parsley PR1 genes. EMBO J. 1996;15:5690–5700. - PMC - PubMed
    1. Cormack RS, Eulgem T, Rushton PJ, Köchner P, Hahlbrock K, Somssich IE. Leucine zipper containing WRKY proteins widen the spectrum of immediate early elicitor-induced WRKY transcription factors in parsley. Biochim Biophys Acta. 2002;1576:92–100. doi: 10.1016/S0167-4781(02)00298-1. - DOI - PubMed

Publication types

MeSH terms

LinkOut - more resources