Homepeptide repeats: implications for protein structure, function and evolution
- PMID: 23084777
- PMCID: PMC5054710
- DOI: 10.1016/j.gpb.2012.04.001
Homepeptide repeats: implications for protein structure, function and evolution
Abstract
Analysis of protein sequences from Mycobacterium tuberculosis H37Rv (Mtb H37Rv) was performed to identify homopeptide repeat-containing proteins (HRCPs). Functional annotation of the HRCPs showed that they are preferentially involved in cellular metabolism. Furthermore, these homopeptide repeats might play some specific roles in protein-protein interaction. Repeat length differences among Bacteria, Archaea and Eukaryotes were calculated in order to identify the conservation of the repeats in these divergent kingdoms. From the results, it was evident that these repeats have a higher degree of conservation in Bacteria and Archaea than in Eukaryotes. In addition, there seems to be a direct correlation between the repeat length difference and the degree of divergence between the species. Our study supports the hypothesis that the presence of homopeptide repeats influences the rate of evolution of the protein sequences in which they are embedded. Thus, homopeptide repeat may have structural, functional and evolutionary implications on proteins.
Copyright © 2012. Published by Elsevier Ltd.
Figures






Similar articles
-
Effect of single amino acid mutations on function of Mycobacterium tuberculosis H37RV and H37RA by computational approaches.Indian J Tuberc. 2014 Jul;61(3):200-6. Indian J Tuberc. 2014. PMID: 25241568
-
Sequence analysis corresponding to the PPE and PE proteins in Mycobacterium tuberculosis and other genomes.J Biosci. 2003 Mar;28(2):169-79. doi: 10.1007/BF02706216. J Biosci. 2003. PMID: 12711809
-
Protein length in eukaryotic and prokaryotic proteomes.Nucleic Acids Res. 2005 Jun 10;33(10):3390-400. doi: 10.1093/nar/gki615. Print 2005. Nucleic Acids Res. 2005. PMID: 15951512 Free PMC article.
-
Mycobacterium tuberculosis: a model system for structural genomics.Curr Opin Struct Biol. 2003 Dec;13(6):658-64. doi: 10.1016/j.sbi.2003.10.004. Curr Opin Struct Biol. 2003. PMID: 14675542 Review.
-
Molecular evolution before the origin of species.Prog Biophys Mol Biol. 2002 May-Jul;79(1-3):77-133. doi: 10.1016/s0079-6107(02)00012-3. Prog Biophys Mol Biol. 2002. PMID: 12225777 Review.
Cited by
-
Search and analysis of identical reverse octapeptides in unrelated proteins.Genomics Proteomics Bioinformatics. 2013 Apr;11(2):114-21. doi: 10.1016/j.gpb.2012.11.005. Epub 2013 Mar 21. Genomics Proteomics Bioinformatics. 2013. PMID: 23523652 Free PMC article.
-
PPS: A computing engine to find Palindromes in all Protein sequences.Bioinformation. 2014 Jan 29;10(1):48-51. doi: 10.6026/97320630010048. eCollection 2014. Bioinformation. 2014. PMID: 24516327 Free PMC article.
-
The relationship between protein domains and homopeptides in the Plasmodium falciparum proteome.PeerJ. 2020 Oct 2;8:e9940. doi: 10.7717/peerj.9940. eCollection 2020. PeerJ. 2020. PMID: 33062426 Free PMC article.
-
Identification and Analysis of Long Repeats of Proteins at the Domain Level.Front Bioeng Biotechnol. 2019 Oct 8;7:250. doi: 10.3389/fbioe.2019.00250. eCollection 2019. Front Bioeng Biotechnol. 2019. PMID: 31649924 Free PMC article.
-
Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.Heliyon. 2017 Dec 28;3(12):e00492. doi: 10.1016/j.heliyon.2017.e00492. eCollection 2017 Dec. Heliyon. 2017. PMID: 29387823 Free PMC article.
References
-
- Nakachi Y., Hayakawa T., Oota H., Sumiyama K., Wang L., Ueda S. Nucleotide compositional constraints on genomes generate alanine-, glcyine-, and proline-rich structures in transcription factors. Mol Biol Evol. 1997;14:1042–1049. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases