Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
- PMID: 29078314
- PMCID: PMC5676897
- DOI: 10.1073/pnas.1707642114
Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths
Erratum in
-
Correction for Nepomnyachiy et al., Complex evolutionary footprints revealed in an analysis of reused protein segments of diverse lengths.Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):E5430. doi: 10.1073/pnas.1807785115. Epub 2018 May 29. Proc Natl Acad Sci U S A. 2018. PMID: 29844173 Free PMC article. No abstract available.
Abstract
Proteins share similar segments with one another. Such "reused parts"-which have been successfully incorporated into other proteins-are likely to offer an evolutionary advantage over de novo evolved segments, as most of the latter will not even have the capacity to fold. To systematically explore the evolutionary traces of segment "reuse" across proteins, we developed an automated methodology that identifies reused segments from protein alignments. We search for "themes"-segments of at least 35 residues of similar sequence and structure-reused within representative sets of 15,016 domains [Evolutionary Classification of Protein Domains (ECOD) database] or 20,398 chains [Protein Data Bank (PDB)]. We observe that theme reuse is highly prevalent and that reuse is more extensive when the length threshold for identifying a theme is lower. Structural domains, the best characterized form of reuse in proteins, are just one of many complex and intertwined evolutionary traces. Others include long themes shared among a few proteins, which encompass and overlap with shorter themes that recur in numerous proteins. The observed complexity is consistent with evolution by duplication and divergence, and some of the themes might include descendants of ancestral segments. The observed recursive footprints, where the same amino acid can simultaneously participate in several intertwined themes, could be a useful concept for protein design. Data are available at http://trachel-srv.cs.haifa.ac.il/rachel/ppi/themes/.
Keywords: ancestral segments; protein evolutionary patterns; protein function annotation; protein space.
Copyright © 2017 the Author(s). Published by PNAS.
Conflict of interest statement
The authors declare no conflict of interest.
Figures




Similar articles
-
Similar protein segments shared between domains of different evolutionary lineages.Protein Sci. 2022 Sep;31(9):e4407. doi: 10.1002/pro.4407. Protein Sci. 2022. PMID: 36040261 Free PMC article.
-
ECOD: new developments in the evolutionary classification of domains.Nucleic Acids Res. 2017 Jan 4;45(D1):D296-D302. doi: 10.1093/nar/gkw1137. Epub 2016 Nov 29. Nucleic Acids Res. 2017. PMID: 27899594 Free PMC article.
-
Navigating Among Known Structures in Protein Space.Methods Mol Biol. 2019;1851:233-249. doi: 10.1007/978-1-4939-8736-8_12. Methods Mol Biol. 2019. PMID: 30298400
-
Searching protein space for ancient sub-domain segments.Curr Opin Struct Biol. 2021 Jun;68:105-112. doi: 10.1016/j.sbi.2020.11.006. Epub 2021 Jan 18. Curr Opin Struct Biol. 2021. PMID: 33476896 Review.
-
Classification of proteins with shared motifs and internal repeats in the ECOD database.Protein Sci. 2016 Jul;25(7):1188-203. doi: 10.1002/pro.2893. Epub 2016 Feb 21. Protein Sci. 2016. PMID: 26833690 Free PMC article. Review.
Cited by
-
Construction of a Deep Neural Network Energy Function for Protein Physics.J Chem Theory Comput. 2022 Sep 13;18(9):5649-5658. doi: 10.1021/acs.jctc.2c00069. Epub 2022 Aug 8. J Chem Theory Comput. 2022. PMID: 35939398 Free PMC article.
-
On Protein Loops, Prior Molecular States and Common Ancestors of Life.J Mol Evol. 2024 Oct;92(5):624-646. doi: 10.1007/s00239-024-10167-y. Epub 2024 Apr 23. J Mol Evol. 2024. PMID: 38652291 Free PMC article. Review.
-
Reused Protein Segments Linked to Functional Dynamics.Mol Biol Evol. 2024 Sep 4;41(9):msae184. doi: 10.1093/molbev/msae184. Mol Biol Evol. 2024. PMID: 39226145 Free PMC article.
-
Mechanisms of Cotranslational Protein Maturation in Bacteria.Front Mol Biosci. 2021 May 25;8:689755. doi: 10.3389/fmolb.2021.689755. eCollection 2021. Front Mol Biosci. 2021. PMID: 34113653 Free PMC article. Review.
-
Nearest neighbor search on embeddings rapidly identifies distant protein relations.Front Bioinform. 2022 Nov 17;2:1033775. doi: 10.3389/fbinf.2022.1033775. eCollection 2022. Front Bioinform. 2022. PMID: 36466147 Free PMC article.
References
-
- Lupas AN, Ponting CP, Russell RB. On the evolution of protein folds: Are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? J Struct Biol. 2001;134:191–203. - PubMed
-
- Söding J, Lupas AN. More than the sum of their parts: On the evolution of proteins from peptides. Bioessays. 2003;25:837–846. - PubMed
-
- Vogel C, Bashton M, Kerrison ND, Chothia C, Teichmann SA. Structure, function and evolution of multidomain proteins. Curr Opin Struct Biol. 2004;14:208–216. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources