Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 22:12:784863.
doi: 10.3389/fgene.2021.784863. eCollection 2021.

Prediction of Protein-Protein Interaction Sites Based on Stratified Attentional Mechanisms

Affiliations

Prediction of Protein-Protein Interaction Sites Based on Stratified Attentional Mechanisms

Minli Tang et al. Front Genet. .

Abstract

Proteins are the basic substances that undertake human life activities, and they often perform their biological functions through interactions with other biological macromolecules, such as cell transmission and signal transduction. Predicting the interaction sites between proteins can deepen the understanding of the principle of protein interactions, but traditional experimental methods are time-consuming and labor-intensive. In this study, a new hierarchical attention network structure, named HANPPIS, by adding six effective features of protein sequence, position-specific scoring matrix (PSSM), secondary structure, pre-training vector, hydrophilic, and amino acid position, is proposed to predict protein-protein interaction (PPI) sites. The experiment proved that our model has obtained very effective results, which was better than the existing advanced calculation methods. More importantly, we used the double-layer attention mechanism to improve the interpretability of the model and to a certain extent solved the problem of the "black box" of deep neural networks, which can be used as a reference for location positioning on the biological level.

Keywords: deep learning; feature fusion; multilevel attention mechanism; protein features; protein–protein interaction.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Structure of HANPPIS. It consists of three steps, including embedded representation, amino acid–level attention and K-mers–level attention. We obtain vector representations of protein sequence fragments through multidimensional features. The vector representation of the protein fragment is the input to the first layer of the attention mechanism, and then the vector representation of the protein sequence is obtained through the second layer of attention and finally input to the prediction layer.
FIGURE 2
FIGURE 2
Amino acid feature generation and expression. This figure illustrates the specific details of the amino acid signature generation. Among them, because the pre-training vector feature dimension is too large, a layer of feedforward neural network is used to reduce the dimension to 50 dimensions. Then, the remaining five features are spliced and finally the 101-dimensional amino acid feature vector as the input of the entire model is obtained.
FIGURE 3
FIGURE 3
Attention distribution. This figure shows the attention visualization result of one of the samples (from the 135th amino acid of protein 1Z0J_A in Dset186, the sample sequence is “RDAKDYA” and the target amino acid is “K”). (A) shows the proportion of K-mers–level attention distribution and (B) shows the distribution of amino acid–level attention. As shown in the figure, the center position has the highest proportion of attention, which is also consistent with the task of protein interaction sites.

Similar articles

Cited by

References

    1. Bradford J. R., Westhead D. R. (2005). Improved Prediction of Protein-Protein Binding Sites Using a Support Vector Machines Approach. Bioinformatics 21, 1487–1494. 10.1093/bioinformatics/bti242 - DOI - PubMed
    1. Chung J.-L., Wang W., Bourne P. E. (2007). High-Throughput Identification of Interacting Protein-Protein Binding Sites. BMC Bioinformatics 8, 223–235. 10.1186/1471-2105-8-223 - DOI - PMC - PubMed
    1. Dhole K., Singh G., Pai P., Mondal S. (2015). SPRINGS: Prediction of Protein-Protein Interaction Sites Using Artificial Neural Networks [J]. Peerj 1, 1–15. 10.7287/PEERJ.PREPRINTS.266V2 - DOI
    1. Dhole K., Singh G., Pai P. P., Mondal S. (2014). Sequence-Based Prediction of Protein-Protein Interaction Sites with L1-Logreg Classifier. J. Theor. Biol. 348, 47–54. 10.1016/j.jtbi.2014.01.028 - DOI - PubMed
    1. Gainza P., Sverrisson F., Monti F., Rodolà E., Boscaini D., Bronstein M. M., et al. (2020). Deciphering Interaction Fingerprints from Protein Molecular Surfaces Using Geometric Deep Learning. Nat. Methods 17 (2), 184–192. 10.1038/s41592-019-0666-6 - DOI - PubMed

LinkOut - more resources