Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr;53(2):173-178.
doi: 10.1007/s11262-016-1416-9. Epub 2016 Dec 20.

A bioinformatics pipeline to search functional motifs within whole-proteome data: a case study of poxviruses

Affiliations

A bioinformatics pipeline to search functional motifs within whole-proteome data: a case study of poxviruses

Haitham Sobhy. Virus Genes. 2017 Apr.

Abstract

Proteins harbor domains or short linear motifs, which facilitate their functions and interactions. Finding functional motifs in protein sequences could predict the putative cellular roles or characteristics of hypothetical proteins. In this study, we present Shetti-Motif, which is an interactive tool to (i) map UniProt and PROSITE flat files, (ii) search for multiple pre-defined consensus patterns or experimentally validated functional motifs in large datasets protein sequences (proteome-wide), (iii) search for motifs containing repeated residues (low-complexity regions, e.g., Leu-, SR-, PEST-rich motifs, etc.). As proof of principle, using this comparative proteomics pipeline, eleven proteomes encoded by member of Poxviridae family were searched against about 100 experimentally validated functional motifs. The closely related viruses and viruses infect the same host cells (e.g. vaccinia and variola viruses) show similar motif-containing proteins profile. The motifs encoded by these viruses are correlated, which explains why poxviruses are able to interact with wide range of host cells. In conclusion, this in silico analysis is useful to establish a dataset(s) or potential proteins for further investigation or compare between species.

Keywords: Comparative genomics; Functional genomics; Low-complexity regions (LCRs); Protein annotation; Protein domain; Protein function.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest

The author declares no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Figures

Fig. 1
Fig. 1
Screenshot of Shetti-Motif main window (a), and flowchart of features and method used in this study (b)
Fig. 2
Fig. 2
Poxviruses encode divergent number of motifs; the motif-containing proteins (McPs) profile of closely related viruses are correlated. The number of motif-containing proteins (i.e., protein containing at least one instance of the query motif) were counted and normalized (percentage) to total number of proteins encoded by a virus, Table 1, S3. a Box and whisker plot shows 1st, 2nd, 3rd quantiles (Q1, Q2 and Q3 respectively) of numbers of McPs, whiskers at 1.5 IQR (interquartile range) (Q3 + 1.5 IQR); b the average and maximum numbers of McPs, the error bars are based on standard deviation values; c Spearman correlation coefficient values and scatterplots (using STASTICA Data Miner; StatSoft, USA) of the number of McPs encoded by each virus. VacV: Vaccinia virus WR, VarV: Variola virus, MPxV: Monkeypox virus, YMTV: Yaba monkey tumor virus, FPxV: Fowlpox virus, CPxV: Canarypox virus, Orf V: Orf virus, CPxV: Cowpox virus, CmPxV: Camelpox virus M-96, MV: Myxoma virus, and NileCV: Nile crocodilepox virus

Similar articles

Cited by

References

    1. Sobhy H. Proteomes. 2016;4:3. doi: 10.3390/proteomes4010003. - DOI - PMC - PubMed
    1. Tompa P, Davey NE, Gibson TJ, Babu MM. Mol. Cell. 2014;55:161–169. doi: 10.1016/j.molcel.2014.05.032. - DOI - PubMed
    1. Van Roey K, Uyar B, Weatheritt RJ, Dinkel H, Seiler M, Budd A, Gibson TJ, Davey NE. Chem. Rev. 2014;114:6733–6778. doi: 10.1021/cr400585q. - DOI - PubMed
    1. Kadaveru K, Vyas J, Schiller MR. Front Biosci. 2008;13:6455–6471. doi: 10.2741/3166. - DOI - PMC - PubMed
    1. Via A, Uyar B, Brun C, Zanzoni A. Trends Biochem. Sci. 2015;40:36–48. doi: 10.1016/j.tibs.2014.11.001. - DOI - PubMed

MeSH terms