Low complexity regions in the proteins of prokaryotes perform important functional roles and are highly conserved
- PMID: 31504783
- PMCID: PMC6821194
- DOI: 10.1093/nar/gkz730
Low complexity regions in the proteins of prokaryotes perform important functional roles and are highly conserved
Abstract
We provide the first high-throughput analysis of the properties and functional role of Low Complexity Regions (LCRs) in more than 1500 prokaryotic and phage proteomes. We observe that, contrary to a widespread belief based on older and sparse data, LCRs actually have a significant, persistent and highly conserved presence and role in many and diverse prokaryotes. Their specific amino acid content is linked to proteins with certain molecular functions, such as the binding of RNA, DNA, metal-ions and polysaccharides. In addition, LCRs have been repeatedly identified in very ancient, and usually highly expressed proteins of the translation machinery. At last, based on the amino acid content enriched in certain categories, we have developed a neural network web server to identify LCRs and accurately predict whether they can bind nucleic acids, metal-ions or are involved in chaperone functions. An evaluation of the tool showed that it is highly accurate for eukaryotic proteins as well.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Similar articles
-
Low-complexity regions in fungi display functional groups and are depleted in positively charged amino acids.NAR Genom Bioinform. 2025 Feb 27;7(1):lqaf014. doi: 10.1093/nargab/lqaf014. eCollection 2025 Mar. NAR Genom Bioinform. 2025. PMID: 40041205 Free PMC article.
-
Why do eukaryotic proteins contain more intrinsically disordered regions?PLoS Comput Biol. 2019 Jul 22;15(7):e1007186. doi: 10.1371/journal.pcbi.1007186. eCollection 2019 Jul. PLoS Comput Biol. 2019. PMID: 31329574 Free PMC article.
-
Novel conserved domains in proteins with predicted roles in eukaryotic cell-cycle regulation, decapping and RNA stability.BMC Genomics. 2004 Jul 16;5(1):45. doi: 10.1186/1471-2164-5-45. BMC Genomics. 2004. PMID: 15257761 Free PMC article.
-
Evolution of synapse complexity and diversity.Annu Rev Neurosci. 2012;35:111-31. doi: 10.1146/annurev-neuro-062111-150433. Annu Rev Neurosci. 2012. PMID: 22715880 Review.
-
Structure-function insights into prokaryotic and eukaryotic translation initiation.Curr Opin Struct Biol. 2009 Jun;19(3):300-9. doi: 10.1016/j.sbi.2009.04.010. Epub 2009 Jun 1. Curr Opin Struct Biol. 2009. PMID: 19493673 Review.
Cited by
-
Recent Advances in Archaeal Translation Initiation.Front Microbiol. 2020 Sep 18;11:584152. doi: 10.3389/fmicb.2020.584152. eCollection 2020. Front Microbiol. 2020. PMID: 33072057 Free PMC article. Review.
-
Regions with two amino acids in protein sequences: A step forward from homorepeats into the low complexity landscape.Comput Struct Biotechnol J. 2022 Sep 18;20:5516-5523. doi: 10.1016/j.csbj.2022.09.011. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 36249567 Free PMC article.
-
The Remarkable Evolutionary Plasticity of Coronaviruses by Mutation and Recombination: Insights for the COVID-19 Pandemic and the Future Evolutionary Paths of SARS-CoV-2.Viruses. 2022 Jan 2;14(1):78. doi: 10.3390/v14010078. Viruses. 2022. PMID: 35062282 Free PMC article. Review.
-
RNA-Binding Proteins: The Key Modulator in Stress Granule Formation and Abiotic Stress Response.Front Plant Sci. 2022 Jun 15;13:882596. doi: 10.3389/fpls.2022.882596. eCollection 2022. Front Plant Sci. 2022. PMID: 35783947 Free PMC article. Review.
-
MIF-like domain containing protein orchestrates cellular differentiation and virulence in the fungal pathogen Magnaporthe oryzae.iScience. 2023 Aug 6;26(9):107565. doi: 10.1016/j.isci.2023.107565. eCollection 2023 Sep 15. iScience. 2023. PMID: 37664630 Free PMC article.
References
-
- Wootton J.C. Non-globular domains in protein sequences: automated segmentation using complexity measures. Comput. Chem. 1994; 18:269–285. - PubMed
-
- Wootton J.C., Drummond M.H.. The Q-linker: a class of interdomain sequences found in bacterial multidomain regulatory proteins. Protein. Eng. 1989; 2:535–543. - PubMed
-
- Huntley M.A., Golding G.B.. Simple sequences are rare in the Protein Data Bank. Proteins. 2002; 48:134–140. - PubMed
-
- Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J.. Basic local alignment search tool. J. Mol. Biol. 1990; 215:403–410. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources