A bioinformatics pipeline to search functional motifs within whole-proteome data: a case study of poxviruses
- PMID: 28000080
- PMCID: PMC5357487
- DOI: 10.1007/s11262-016-1416-9
A bioinformatics pipeline to search functional motifs within whole-proteome data: a case study of poxviruses
Abstract
Proteins harbor domains or short linear motifs, which facilitate their functions and interactions. Finding functional motifs in protein sequences could predict the putative cellular roles or characteristics of hypothetical proteins. In this study, we present Shetti-Motif, which is an interactive tool to (i) map UniProt and PROSITE flat files, (ii) search for multiple pre-defined consensus patterns or experimentally validated functional motifs in large datasets protein sequences (proteome-wide), (iii) search for motifs containing repeated residues (low-complexity regions, e.g., Leu-, SR-, PEST-rich motifs, etc.). As proof of principle, using this comparative proteomics pipeline, eleven proteomes encoded by member of Poxviridae family were searched against about 100 experimentally validated functional motifs. The closely related viruses and viruses infect the same host cells (e.g. vaccinia and variola viruses) show similar motif-containing proteins profile. The motifs encoded by these viruses are correlated, which explains why poxviruses are able to interact with wide range of host cells. In conclusion, this in silico analysis is useful to establish a dataset(s) or potential proteins for further investigation or compare between species.
Keywords: Comparative genomics; Functional genomics; Low-complexity regions (LCRs); Protein annotation; Protein domain; Protein function.
Conflict of interest statement
Conflicts of interest
The author declares no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Figures


Similar articles
-
C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families.BMC Genomics. 2007 Jun 26;8:191. doi: 10.1186/1471-2164-8-191. BMC Genomics. 2007. PMID: 17594486 Free PMC article.
-
SLiMSearch: a framework for proteome-wide discovery and annotation of functional modules in intrinsically disordered regions.Nucleic Acids Res. 2017 Jul 3;45(W1):W464-W469. doi: 10.1093/nar/gkx238. Nucleic Acids Res. 2017. PMID: 28387819 Free PMC article.
-
Use of viral motif mimicry improves the proteome-wide discovery of human linear motifs.Cell Rep. 2022 May 3;39(5):110764. doi: 10.1016/j.celrep.2022.110764. Cell Rep. 2022. PMID: 35508127
-
How pathogens use linear motifs to perturb host cell networks.Trends Biochem Sci. 2015 Jan;40(1):36-48. doi: 10.1016/j.tibs.2014.11.001. Epub 2014 Dec 1. Trends Biochem Sci. 2015. PMID: 25475989 Review.
-
Bioinformatics Approaches for Predicting Disordered Protein Motifs.Adv Exp Med Biol. 2015;870:291-318. doi: 10.1007/978-3-319-20164-1_9. Adv Exp Med Biol. 2015. PMID: 26387106 Review.
Cited by
-
A comparative review of viral entry and attachment during large and giant dsDNA virus infections.Arch Virol. 2017 Dec;162(12):3567-3585. doi: 10.1007/s00705-017-3497-8. Epub 2017 Sep 2. Arch Virol. 2017. PMID: 28866775 Free PMC article. Review.
-
Virophages and Their Interactions with Giant Viruses and Host Cells.Proteomes. 2018 May 22;6(2):23. doi: 10.3390/proteomes6020023. Proteomes. 2018. PMID: 29786634 Free PMC article. Review.
-
Resources to Discover and Use Short Linear Motifs in Viral Proteins.Trends Biotechnol. 2020 Jan;38(1):113-127. doi: 10.1016/j.tibtech.2019.07.004. Epub 2019 Aug 16. Trends Biotechnol. 2020. PMID: 31427097 Free PMC article. Review.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials