TAPO: A combined method for the identification of tandem repeats in protein structures
- PMID: 26320412
- DOI: 10.1016/j.febslet.2015.08.025
TAPO: A combined method for the identification of tandem repeats in protein structures
Abstract
In recent years, there has been an emergence of new 3D structures of proteins containing tandem repeats (TRs), as a result of improved expression and crystallization strategies. Databases focused on structure classifications (PDB, SCOP, CATH) do not provide an easy solution for selection of these structures from PDB. Several approaches have been developed, but no best approach exists to identify the whole range of 3D TRs. Here we describe the TAndem PrOtein detector (TAPO) that uses periodicities of atomic coordinates and other types of structural representation, including strings generated by conformational alphabets, residue contact maps, and arrangements of vectors of secondary structure elements. The benchmarking shows the superior performance of TAPO over the existing programs. In accordance with our analysis of PDB using TAPO, 19% of proteins contain 3D TRs. This analysis allowed us to identify new families of 3D TRs, suggesting that TAPO can be used to regularly update the collection and classification of existing repetitive structures.
Keywords: 3D protein structure; Non-globular protein; Prediction of repetitive unit; Prediction pipeline; Proteome; Tandem repeat; Webserver.
Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Similar articles
-
In search of the boundary between repetitive and non-repetitive protein sequences.Biochem Soc Trans. 2015 Oct;43(5):807-11. doi: 10.1042/BST20150073. Biochem Soc Trans. 2015. PMID: 26517886 Review.
-
Tally: a scoring tool for boundary determination between repetitive and non-repetitive protein sequences.Bioinformatics. 2016 Jul 1;32(13):1952-8. doi: 10.1093/bioinformatics/btw118. Epub 2016 Mar 7. Bioinformatics. 2016. PMID: 27153701
-
Tandem repeats in proteins: from sequence to structure.J Struct Biol. 2012 Sep;179(3):279-88. doi: 10.1016/j.jsb.2011.08.009. Epub 2011 Aug 24. J Struct Biol. 2012. PMID: 21884799
-
T-REKS: identification of Tandem REpeats in sequences with a K-meanS based algorithm.Bioinformatics. 2009 Oct 15;25(20):2632-8. doi: 10.1093/bioinformatics/btp482. Epub 2009 Aug 11. Bioinformatics. 2009. PMID: 19671691
-
Comparison of protein repeat classifications based on structure and sequence families.Biochem Soc Trans. 2015 Oct;43(5):832-7. doi: 10.1042/BST20150079. Biochem Soc Trans. 2015. PMID: 26517890 Review.
Cited by
-
STRPsearch: fast detection of structured tandem repeat proteins.Bioinformatics. 2024 Nov 28;40(12):btae690. doi: 10.1093/bioinformatics/btae690. Bioinformatics. 2024. PMID: 39558588 Free PMC article.
-
Structured Tandem Repeats in Protein Interactions.Int J Mol Sci. 2024 Mar 5;25(5):2994. doi: 10.3390/ijms25052994. Int J Mol Sci. 2024. PMID: 38474241 Free PMC article.
-
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures.Nucleic Acids Res. 2017 Jan 4;45(D1):D308-D312. doi: 10.1093/nar/gkw1136. Epub 2016 Nov 29. Nucleic Acids Res. 2017. PMID: 27899671 Free PMC article.
-
Side chain flexibility and the symmetry of protein homodimers.PLoS One. 2020 Jul 24;15(7):e0235863. doi: 10.1371/journal.pone.0235863. eCollection 2020. PLoS One. 2020. PMID: 32706779 Free PMC article.
-
MemSTATS: A Benchmark Set of Membrane Protein Symmetries and Pseudosymmetries.J Mol Biol. 2020 Jan 17;432(2):597-604. doi: 10.1016/j.jmb.2019.09.020. Epub 2019 Oct 16. J Mol Biol. 2020. PMID: 31628944 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources