Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May 9:2018:1391265.
doi: 10.1155/2018/1391265. eCollection 2018.

Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids

Affiliations

Predicting Interactions between Virus and Host Proteins Using Repeat Patterns and Composition of Amino Acids

Saud Alguwaizani et al. J Healthc Eng. .

Abstract

Previous methods for predicting protein-protein interactions (PPIs) were mainly focused on PPIs within a single species, but PPIs across different species have recently emerged as an important issue in some areas such as viral infection. The primary focus of this study is to predict PPIs between virus and its targeted host, which are involved in viral infection. We developed a general method that predicts interactions between virus and host proteins using the repeat patterns and composition of amino acids. In independent testing of the method with PPIs of new viruses and hosts, it showed a high performance comparable to the best performance of other methods for single virus-host PPIs. In comparison of our method with others using same datasets, our method outperformed the others. The repeat patterns and composition of amino acids are simple, yet powerful features for predicting virus-host PPIs. The method developed in this study will help in finding new virus-host PPIs for which little information is available.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Example of computing feature 1 (F1) of amino acid repeats. F1 is the sum of squared length of single amino acid repeats (SARs) in the whole protein sequence. In this example, F1 (repeats of S) = 12 + 32 + 32 = 19, F1 (repeats of W) = 42 + 22 = 20, and F1 (repeats of R) = 12 + 62 = 37.
Figure 2
Figure 2
Example of computing feature 2 (F2) of amino acid repeats. F2 is the maximum value of the sum of squared length of single amino acid repeats in a window of size six. The maximum repeat size of amino acid S is 3, which is observed in the windows starting at 4, 5, 6, 7, 13, 14, and 15. So, F2 (repeats of S) = 32 = 9. The maximum repeat size of amino acid W is 4, observed in the windows starting at 1 and 2. F2 (repeats of W) = 42 = 16. The maximum repeat size of amino acid R is 6, observed in the window starting at 10. F2 (repeats of R) = 62 = 36.
Figure 3
Figure 3
Values of feature 2 for windows of six and seven residues. With a window of size 6, different patterns of single amino acid repeats lead to 15 different values for feature 2. With a window of size 7, different patterns of single amino acid repeats can lead to a same value for feature 2, as shown in this example.
Figure 4
Figure 4
ROC curves of independent testing of the SVM model on PPIs of new viruses with human. The SVM model showed the largest area under the ROC curve (AUC) of 0.970 for the PPIs of the SARS virus with human.
Figure 5
Figure 5
ROC curves of independent testing of the SVM model on PPIs of new hosts with viruses. The SVM model showed the largest area under the ROC curve (AUC) of 0.959 for the PPIs of E. coli K-12 with viruses.

References

    1. de La Vega M.-A., Stein D., Kobinger G. P. Ebolavirus evolution. Past and present. 2015;11(11):1–10. doi: 10.1371/journal.ppat.1005221. - DOI - PMC - PubMed
    1. Gatherer D. The 2014 Ebola virus disease outbreak in West Africa. 2014;95(8):1619–1624. doi: 10.1099/vir.0.067199-0. - DOI - PubMed
    1. Zhou H., Gao S., Ninh Nguyen N., et al. Stringent homology-based prediction of H. sapiens-M. tuberculosis H37Rv protein-protein interactions. 2014;9(1):p. 5. doi: 10.1186/1745-6150-9-5. - DOI - PMC - PubMed
    1. Cui G., Fang C., Han K. Prediction of protein-protein interactions between viruses and human by an SVM model. 2012;13(7):p. S5. doi: 10.1186/1471-2105-13-s7-s5. - DOI - PMC - PubMed
    1. Kim B., Alguwaizani S., Zhou X., Huang D.-S., Park B., Han K. An improved method for predicting interactions between virus and human proteins. 2017;15(1):p. 1650024. doi: 10.1142/s0219720016500244. - DOI - PubMed

Publication types

LinkOut - more resources