Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes
- PMID: 22009164
- DOI: 10.1039/c1mb05318c
Occurrence of disordered patterns and homorepeats in eukaryotic and bacterial proteomes
Abstract
Combining the motif discovery and disorder protein segment identification in PDB allows us to create the first and largest library of disordered patterns. At present the library includes 109 disordered patterns. Here we offer a comprehensive analysis of the occurrence of selected disordered patterns and 20 homorepeats of 6 residues long in 123 proteomes. 27 disordered patterns occur sparsely in all considered proteomes, but the patterns of low-complexity-homorepeats-appear more often in eukaryotic than in bacterial proteomes. A comparative analysis of the number of proteins containing homorepeats of 6 residues long and the disordered selected patterns in these proteomes has been performed. The matrices of correlation coefficients between numbers of proteins where at least once a homorepeat of six residues long for each of 20 types of amino acid residues and 109 disordered patterns from the library appears in 9 kingdoms of eukaryota and 5 phyla of bacteria have been calculated. As a rule, the correlation coefficients are higher inside the considered kingdom than between them. The largest fraction of homorepeats of 6 residues belongs to Amoebozoa proteomes (D. discoideum), 46%. Moreover, the longest uninterrupted repeats belong to S306 from D. discoideum (Amoebozoa). Homorepeats of some amino acids occur more frequently than others and the type of homorepeats varies across different proteomes, . For example, E6 appears most frequent for all considered proteomes for Chordata, Q6 for Arthropoda, S6 for Nematoda. The averaged occurrence of multiple long runs of 6 amino acids in a decreasing order for 97 eukaryotic proteomes is as follows: Q6, S6, A6, G6, N6, E6, P6, T6, D6, K6, L6, H6, R6, F6, V6, I6, Y6, C6, M6, W6, and for 26 bacterial proteomes it is A6, G6, P6, and the others occur seldom. This suggests that such short similar motifs are responsible for common functions for nonhomologous, unrelated proteins from different organisms.
Similar articles
-
HRaP: database of occurrence of HomoRepeats and patterns in proteomes.Nucleic Acids Res. 2014 Jan;42(Database issue):D273-8. doi: 10.1093/nar/gkt927. Epub 2013 Oct 22. Nucleic Acids Res. 2014. PMID: 24150944 Free PMC article.
-
[Occurrence of motifs with six amino acid residues in three eukaryotic proteomes].Mol Biol (Mosk). 2012 Jan-Feb;46(1):184-90. Mol Biol (Mosk). 2012. PMID: 22642117 Russian.
-
Disordered patterns in clustered Protein Data Bank and in eukaryotic and bacterial proteomes.PLoS One. 2011;6(11):e27142. doi: 10.1371/journal.pone.0027142. Epub 2011 Nov 4. PLoS One. 2011. PMID: 22073276 Free PMC article.
-
Structure-function relationships in protein homorepeats.Curr Opin Struct Biol. 2023 Dec;83:102726. doi: 10.1016/j.sbi.2023.102726. Epub 2023 Nov 2. Curr Opin Struct Biol. 2023. PMID: 37924569 Review.
-
Protein homorepeats sequences, structures, evolution, and functions.Adv Protein Chem Struct Biol. 2010;79:59-88. doi: 10.1016/S1876-1623(10)79002-7. Adv Protein Chem Struct Biol. 2010. PMID: 20621281 Review.
Cited by
-
Predicting the involvement of polyQ- and polyA in protein-protein interactions by their amino acid context.Heliyon. 2024 Sep 14;10(18):e37861. doi: 10.1016/j.heliyon.2024.e37861. eCollection 2024 Sep 30. Heliyon. 2024. PMID: 39323775 Free PMC article.
-
HRaP: database of occurrence of HomoRepeats and patterns in proteomes.Nucleic Acids Res. 2014 Jan;42(Database issue):D273-8. doi: 10.1093/nar/gkt927. Epub 2013 Oct 22. Nucleic Acids Res. 2014. PMID: 24150944 Free PMC article.
-
Proteome-scale relationships between local amino acid composition and protein fates and functions.PLoS Comput Biol. 2018 Sep 24;14(9):e1006256. doi: 10.1371/journal.pcbi.1006256. eCollection 2018 Sep. PLoS Comput Biol. 2018. PMID: 30248088 Free PMC article.
-
Quarterly intrinsic disorder digest (January-February-March, 2014).Intrinsically Disord Proteins. 2016 Feb 12;4(1):e1153395. doi: 10.1080/21690707.2016.1153395. eCollection 2016. Intrinsically Disord Proteins. 2016. PMID: 28232896 Free PMC article.
-
Reviewing the Structure-Function Paradigm in Polyglutamine Disorders: A Synergistic Perspective on Theoretical and Experimental Approaches.Int J Mol Sci. 2024 Jun 20;25(12):6789. doi: 10.3390/ijms25126789. Int J Mol Sci. 2024. PMID: 38928495 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous