Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 30;39(39 Suppl 1):i308-i317.
doi: 10.1093/bioinformatics/btad203.

A gated graph transformer for protein complex structure quality assessment and its performance in CASP15

Affiliations

A gated graph transformer for protein complex structure quality assessment and its performance in CASP15

Xiao Chen et al. Bioinformatics. .

Abstract

Motivation: Proteins interact to form complexes to carry out essential biological functions. Computational methods such as AlphaFold-multimer have been developed to predict the quaternary structures of protein complexes. An important yet largely unsolved challenge in protein complex structure prediction is to accurately estimate the quality of predicted protein complex structures without any knowledge of the corresponding native structures. Such estimations can then be used to select high-quality predicted complex structures to facilitate biomedical research such as protein function analysis and drug discovery.

Results: In this work, we introduce a new gated neighborhood-modulating graph transformer to predict the quality of 3D protein complex structures. It incorporates node and edge gates within a graph transformer framework to control information flow during graph message passing. We trained, evaluated and tested the method (called DProQA) on newly-curated protein complex datasets before the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) and then blindly tested it in the 2022 CASP15 experiment. The method was ranked 3rd among the single-model quality assessment methods in CASP15 in terms of the ranking loss of TM-score on 36 complex targets. The rigorous internal and external experiments demonstrate that DProQA is effective in ranking protein complex structures.

Availability and implementation: The source code, data, and pre-trained models are available at https://github.com/jianlin-cheng/DProQA.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
An overview of the DProQA pipeline for quality assessment of protein complexes. The input is a protein complex structure. The output includes the predicted quality score of the input structure (e.g. DockQ score) and the probability of the quality class (i.e. incorrect topology, acceptable quality, medium quality, and high quality) which the structure is classified into.
Figure 2.
Figure 2.
The gated graph transformer (GGT) model architecture for updating the features of nodes of a protein complex graph.
Figure 3.
Figure 3.
The average TM-score ranking loss for all single-model methods. MULTICOM_egnn ranked 3rd among all single-model methods.
Figure 4.
Figure 4.
MULTICOM_egnn’s loss histogram for CASP15 targets. The black dashed vertical line represents the position of the mean value.
Figure 5.
Figure 5.
Target H1111. (a) TM-score distribution of CASP15 models of H1111. (b). From left to right, the three protein complex structures shown are the corresponding native structure, the true TOP-1 model, and the MULTICOM_egnn top selected model, respectively. Here, MULTICOM_egnn achieved a 0.0014 TM-score ranking loss.

References

    1. Athanasios A, Charalampos V, Vasileios T. et al. Protein–protein interaction (PPI) network: recent advances in drug discovery. Curr Drug Metab 2017;18:5–10. - PubMed
    1. Baker D. Prediction and design of macromolecular structures and interactions. Philos Trans R Soc Lond B Biol Sci 2006;361:459–63. - PMC - PubMed
    1. Basu S, Wallner B.. Dockq: a quality measure for protein–protein docking models. PLoS ONE 2016a;11:e0161879. - PMC - PubMed
    1. Basu S, Wallner B.. Finding correct protein–protein docking models using proqdock. Bioinformatics 2016b;32:i262–i270. - PMC - PubMed
    1. Bryant P, Pozzati G, Elofsson A. et al. Improved prediction of protein–protein interactions using alphafold2. Nat Commun 2022;13:1–11. - PMC - PubMed

Publication types