Comparative Study

. 2006 Aug 9:7:200.

doi: 10.1186/1471-2164-7-200.

Cross genome comparisons of serine proteases in Arabidopsis and rice

Lokesh P Tripathi¹, R Sowdhamini

Affiliations

PMID: 16895613
PMCID: PMC1560137
DOI: 10.1186/1471-2164-7-200

Comparative Study

Cross genome comparisons of serine proteases in Arabidopsis and rice

Lokesh P Tripathi et al. BMC Genomics. 2006.

. 2006 Aug 9:7:200.

doi: 10.1186/1471-2164-7-200.

Authors

Lokesh P Tripathi¹, R Sowdhamini

Affiliation

¹ National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK Campus, Bellary Road, Bangalore 560 065, India. lokesh@ncbs.res.in

PMID: 16895613
PMCID: PMC1560137
DOI: 10.1186/1471-2164-7-200

Abstract

Background: Serine proteases are one of the largest groups of proteolytic enzymes found across all kingdoms of life and are associated with several essential physiological pathways. The availability of Arabidopsis thaliana and rice (Oryza sativa) genome sequences has permitted the identification and comparison of the repertoire of serine protease-like proteins in the two plant species.

Results: Despite the differences in genome sizes between Arabidopsis and rice, we identified a very similar number of serine protease-like proteins in the two plant species (206 and 222, respectively). Nearly 40% of the above sequences were identified as potential orthologues. Atypical members could be identified in the plant genomes for Deg, Clp, Lon, rhomboid proteases and species-specific members were observed for the highly populated subtilisin and serine carboxypeptidase families suggesting multiple lateral gene transfers. DegP proteases, prolyl oligopeptidases, Clp proteases and rhomboids share a significantly higher percentage orthology between the two genomes indicating substantial evolutionary divergence was set prior to speciation. Single domain architectures and paralogues for several putative subtilisins, serine carboxypeptidases and rhomboids suggest they may have been recruited for additional roles in secondary metabolism with spatial and temporal regulation. The analysis reveals some domain architectures unique to either or both of the plant species and some inactive proteases, like in rhomboids and Clp proteases, which could be involved in chaperone function.

Conclusion: The systematic analysis of the serine protease-like proteins in the two plant species has provided some insight into the possible functional associations of previously uncharacterised serine protease-like proteins. Further investigation of these aspects may prove beneficial in our understanding of similar processes in commercially significant crop plant species.

PubMed Disclaimer

Figures

**Figure 1**
Unrooted N-J tree computed from multiple sequence alignments of *Arabidopsis* (red) and rice (blue) subtilisin domains. Subtilisin-like protease domains were aligned using ClustalW [95] program and the alignments were exported to Phylip package [96] for representing the Neighbor-Joining tree (see methods). The colors and circles represent different evolutionary clades identified in the analysis (see text for details). Clade I is represented in purple, Clade II is shaded orange, Clade III in green, Clade IV in brown and Clade V in yellow. For clarity, bootstrap values were replaced with symbols representing bootstrap percentages >50%. Bootstrap values between 50–60% are represented by an asterix, circles represent bootstrap values from 60%–80% while bootstrap values >80% are represented by rectangles. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene. A few species specific gene clusters were also identified in the analysis (see text for details).

**Figure 2**
Domain Architectures identified in *Arabidopsis* and rice serine Protease-like proteins. At -*Arabidopsis thaliana*; Os – Rice (*Oryza sativa*); B- Bacteria; A- Archaea; E- Eukaryota; Sxx- Serine protease family Sxx domain, where Sxx refers to the serine protease family as per MEROPS [5] classification (see text for details). PDZ- PDZ domain (Pfam [37] accession: PF00595); PA- Protease associated domain (Pfam [37] accession: PF02225); SUB N- Subtilisin N-terminal region (Pfam [37] accession: PF005922); DUF1034- Domain of unknown function (Pfam [37] accession: PF06280); Arf- ADP-ribosylation factor family (Pfam [37] accession: PF00025); C2- C2 domain (Pfam [37] accession: PF00168); zf-CCHC- Zinc knuckle (Pfam [37] accession: PF00098); rve- Integrase core domain (Pfam accession:PF00665); Extensin 2- Extensin-like region (Pfam [37] accession: PF04554); S9 N- Prolyl oligopeptidase, N-terminal beta-propeller domain (Pfam [37] accession: PF02897); PD40- WD40-like beta propeller repeat (Pfam [37] accession: PF07676); DPPIV N- Dipeptidyl peptidase (DPP IV) N-terminal region (Pfam [37] accession: PF00930); Transposase 21- Transposase family tnp2 (Pfam [37] accession: PF02992); Retrotrans gag- Retrotransposon gag protein (Pfam [37] accession: PF03732); ABC1- ABC1 family (Pfam [37] accession: PF03109); LON- ATP-dependent protease La (LON) domain (Pfam [37] accession: PF02190); AAA- ATPase family associated with various cellular activities (Pfam [37] accession: PF00004); UBA- UBA/TN-S (ubiquitin associated) domain (Pfam [37] accession: PF000627); zf-RanBP- Zinc finger in Ran binding protein and others (Pfam [37] accession: PF00641).

**Figure 3**
Unrooted N-J tree computed from multiple sequence alignments of *Arabidopsis* (red) and rice (blue) prolyl oligopeptidase domains. Prolyl oligopeptidase-like domains were aligned using ClustalW [95] program and the alignments were exported to Phylip package [96] for representing the Neighbor-Joining tree (see methods). The colors and circles represent the two evolutionary clades identified in the analysis (see text for details). Clade I is represented in Orange, Clade II is shaded green. For clarity, bootstrap values were replaced with symbols representing bootstrap percentages >50%. Bootstrap values between 50–60% are represented by an asterix, circles represent bootstrap values from 60%–80% while bootstrap values >80% are represented by rectangles. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene. Subfamily assignments where possible are indicated in parentheses below the gene name (see Figure SF3 and text for details).

**Figure 4**
Multiple sequence alignment of the Clp protease domain region of the annotated *Arabidopsis* and rice Clp protease-like proteins. The catalytic triad residues are indicated. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene. Several gene products that display mutation in one or more catalytic triad residues can be visualised here (see text for details).

**Figure 5**
Unrooted N-J tree computed from multiple sequence alignments of *Arabidopsis* (red) and rice (blue) Clp protease domains. Clp protease-like domains were aligned using ClustalW [95] program and the alignments were exported to Phylip package [96] for representing the Neighbor-Joining tree (see methods). The colors and circles represent different evolutionary clades identified in the analysis (see text for details). Clade I is represented in orange, while clades II-VIII are shaded in black. For clarity, bootstrap values were replaced with symbols representing bootstrap percentages >50%. Bootstrap values between 50–60% are represented by an asterix, circles represent bootstrap values from 60%–80% while bootstrap values >80% are represented by rectangles. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene.

**Figure 6**
Multiple sequence alignment of the Type I Spase domain region of the annotated *Arabidopsis* and rice Type I Spase-like proteins. The catalytic dyad residues are indicated. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene. The variations in the second residue (K/H) of catalytic dyad can be identified here (see text for details).

**Figure 7**
Unrooted N-J tree computed from multiple sequence alignments of *Arabidopsis* (red) and rice (blue) family S28 protease domains. S28-like protease domains were aligned using ClustalW [95] program and the alignments were exported to Phylip package [96] for representing the Neighbor-Joining tree (see methods). The colors represent the two evolutionary clades identified in the analysis (see text for details). Clade I is represented in Orange, Clade II is shaded green. For clarity, bootstrap values were replaced with symbols representing bootstrap percentages >50%. Bootstrap values between 50–60% are represented by an asterix, circles represent bootstrap values from 60%–80% while bootstrap values >80% are represented by rectangles. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene.

**Figure 8**
Unrooted N-J tree computed from multiple sequence alignments of *Arabidopsis* (red) and rice (blue) rhomboid protease domains. Rhomboid protease-like domains were aligned using ClustalW [95] program and the alignments were exported to Phylip package [96] for representing the Neighbor-Joining tree (see methods). The colors and circles represent different evolutionary clades identified in the analysis (see text for details). Clade I is represented in orange, Clade II is shaded brown, Clade III in green, Clade IV in purple. Clades V-VIII are shaded in black. For clarity, bootstrap values were replaced with symbols representing bootstrap percentages >50%. Bootstrap values between 50–60% are represented by an asterix, circles represent bootstrap values from 60%–80% while bootstrap values >80% are represented by rectangles. Gene names correspond to those in Additional files 1 and 2. For brevity, rice gene names have been shortened to OsXXg##### instead of LOC_OsXXg#####, XX referring to chromosome 1–12 and a 5 digit number assigned to each gene.

See this image and copyright information in PMC

Cited by

Genome-wide survey of prokaryotic serine proteases: analysis of distribution and domain architectures of five serine protease families in prokaryotes.
Tripathi LP, Sowdhamini R. Tripathi LP, et al. BMC Genomics. 2008 Nov 19;9:549. doi: 10.1186/1471-2164-9-549. BMC Genomics. 2008. PMID: 19019219 Free PMC article.
Genome-Wide Investigation and Co-Expression Network Analysis of SBT Family Gene in Gossypium.
Xue T, Liu L, Zhang X, Li Z, Sheng M, Ge X, Xu W, Su Z. Xue T, et al. Int J Mol Sci. 2023 Mar 17;24(6):5760. doi: 10.3390/ijms24065760. Int J Mol Sci. 2023. PMID: 36982835 Free PMC article.
Horizontal transfer of a subtilisin gene from plants into an ancestor of the plant pathogenic fungal genus Colletotrichum.
Armijos Jaramillo VD, Vargas WA, Sukno SA, Thon MR. Armijos Jaramillo VD, et al. PLoS One. 2013;8(3):e59078. doi: 10.1371/journal.pone.0059078. Epub 2013 Mar 15. PLoS One. 2013. PMID: 23554975 Free PMC article.
The family of Deg/HtrA proteases in plants.
Schuhmann H, Huesgen PF, Adamska I. Schuhmann H, et al. BMC Plant Biol. 2012 Apr 20;12:52. doi: 10.1186/1471-2229-12-52. BMC Plant Biol. 2012. PMID: 22520048 Free PMC article.
BRS1 mediates plant redox regulation and cold responses.
Zhang D, Zhao Y, Wang J, Zhao P, Xu S. Zhang D, et al. BMC Plant Biol. 2021 Jun 11;21(1):268. doi: 10.1186/s12870-021-03045-y. BMC Plant Biol. 2021. PMID: 34116634 Free PMC article.

See all "Cited by" articles

References

1. Callis J. Regulation of Protein Degradation. Plant Cell. 1995;7:845–857. doi: 10.1105/tpc.7.7.845. - DOI - PMC - PubMed
1. Schaller A. A cut above the rest: the regulatory function of plant proteases. Planta. 2004;220:183–197. doi: 10.1007/s00425-004-1407-2. - DOI - PubMed
1. Barrett AJ, Rawlings ND. Families and clans of serine peptidases. Arch Biochem Biophys. 1995;318:247–250. doi: 10.1006/abbi.1995.1227. - DOI - PubMed
1. Rawlings ND, Barrett AJ. Dipeptidyl-peptidase II is related to lysosomal Pro-X carboxypeptidase. Biochim Biophys Acta. 1996;1298:1–3. - PubMed
1. Rawlings ND, Morton FR, Barrett AJ. MEROPS: the peptidase database. Nucleic Acids Res. 2006;34:D270–2. doi: 10.1093/nar/gkj089. - DOI - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

WT_/Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Molecular Biology Databases
- GlyGen glycoinformatics resource
- The Arabidopsis Information Resource
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Cross genome comparisons of serine proteases in Arabidopsis and rice

Affiliation

Cross genome comparisons of serine proteases in Arabidopsis and rice

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Miscellaneous

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases

Miscellaneous