Domain deletions and substitutions in the modular protein evolution
- PMID: 16640566
- DOI: 10.1111/j.1742-4658.2006.05220.x
Domain deletions and substitutions in the modular protein evolution
Abstract
The main mechanisms shaping the modular evolution of proteins are gene duplication, fusion and fission, recombination and loss of fragments. While a large body of research has focused on duplications and fusions, we concentrated, in this study, on how domains are lost. We investigated motif databases and introduced a measure of protein similarity that is based on domain arrangements. Proteins are represented as strings of domains and comparison was based on the classic dynamic alignment scheme. We found that domain losses and duplications were more frequent at the ends of proteins. We showed that losses can be explained by the introduction of start and stop codons which render the terminal domains nonfunctional, such that further shortening, until the whole domain is lost, is not evolutionarily selected against. We demonstrated that domains which also occur as single-domain proteins are less likely to be lost at the N terminus and in the middle, than at the C terminus. We conclude that fission/fusion events with single-domain proteins occur mostly at the C terminus. We found that domain substitutions are rare, in particular in the middle of proteins. We also showed that many cases of substitutions or losses result from erroneous annotations, but we were also able to find courses of evolutionary events where domains vanish over time. This is explained by a case study on the bacterial formate dehydrogenases.
Similar articles
-
Gene fusion/fission is a major contributor to evolution of multi-domain bacterial proteins.Bioinformatics. 2006 Jun 15;22(12):1418-23. doi: 10.1093/bioinformatics/btl135. Epub 2006 Apr 6. Bioinformatics. 2006. PMID: 16601004
-
Quantification of the elevated rate of domain rearrangements in metazoa.J Mol Biol. 2007 Oct 5;372(5):1337-48. doi: 10.1016/j.jmb.2007.06.022. Epub 2007 Jun 15. J Mol Biol. 2007. PMID: 17689563
-
Domain rearrangements in protein evolution.J Mol Biol. 2005 Nov 4;353(4):911-23. doi: 10.1016/j.jmb.2005.08.067. Epub 2005 Sep 21. J Mol Biol. 2005. PMID: 16198373
-
Arrangements in the modular evolution of proteins.Trends Biochem Sci. 2008 Sep;33(9):444-51. doi: 10.1016/j.tibs.2008.05.008. Epub 2008 Jul 24. Trends Biochem Sci. 2008. PMID: 18656364 Review.
-
How do new proteins arise?Curr Opin Struct Biol. 2010 Jun;20(3):390-6. doi: 10.1016/j.sbi.2010.02.005. Epub 2010 Mar 27. Curr Opin Struct Biol. 2010. PMID: 20347587 Review.
Cited by
-
Comparative analysis of the phytocyanin gene family in 10 plant species: a focus on Zea mays.Front Plant Sci. 2015 Jul 13;6:515. doi: 10.3389/fpls.2015.00515. eCollection 2015. Front Plant Sci. 2015. PMID: 26217366 Free PMC article.
-
Molecular and phylogenetic characterization of the sieve element occlusion gene family in Fabaceae and non-Fabaceae plants.BMC Plant Biol. 2010 Oct 8;10:219. doi: 10.1186/1471-2229-10-219. BMC Plant Biol. 2010. PMID: 20932300 Free PMC article.
-
The nature of protein domain evolution: shaping the interaction network.Curr Genomics. 2010 Aug;11(5):368-76. doi: 10.2174/138920210791616725. Curr Genomics. 2010. PMID: 21286315 Free PMC article.
-
Reassessing domain architecture evolution of metazoan proteins: major impact of gene prediction errors.Genes (Basel). 2011 Jul 13;2(3):449-501. doi: 10.3390/genes2030449. Genes (Basel). 2011. PMID: 24710207 Free PMC article.
-
Comparative Genome-Wide Analysis of the Malate Dehydrogenase Gene Families in Cotton.PLoS One. 2016 Nov 9;11(11):e0166341. doi: 10.1371/journal.pone.0166341. eCollection 2016. PLoS One. 2016. PMID: 27829020 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources