Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 29;5(1):vbaf180.
doi: 10.1093/bioadv/vbaf180. eCollection 2025.

Estimating protein complex model accuracy using graph transformers and pairwise similarity graphs

Affiliations

Estimating protein complex model accuracy using graph transformers and pairwise similarity graphs

Jian Liu et al. Bioinform Adv. .

Abstract

Motivation: Estimation of protein complex structure accuracy is essential for effective structural model selection in structural biology applications such as protein function analysis and drug design. Despite the success of structure prediction methods such as AlphaFold2 and AlphaFold3, selecting top-quality structural models from large model pools remains challenging.

Results: We present GATE, a novel method that uses graph transformers on pairwise model similarity graphs to predict the quality (accuracy) of complex structural models. By integrating single-model and multimodel quality features, GATE captures intrinsic model characteristics and intermodel geometric similarities to make robust predictions. On the dataset of the 15th Critical Assessment of Protein Structure Prediction (CASP15), GATE achieved the highest Pearson's correlation (0.748) and the lowest ranking loss (0.1191) compared with existing methods. In the blind CASP16 experiment, GATE ranked fifth based on the sum of z-scores, with a Pearson's correlation of 0.7076 (first), a Spearman's correlation of 0.4514 (fourth), a ranking loss of 0.1221 (third), and an area under the curve score of 0.6680 (third) on per-target TM-score-based metrics. Additionally, GATE also performed consistently on large in-house datasets generated by extensive AlphaFold-based sampling with MULTICOM4, confirming its robustness and practical applicability in real-world model selection scenarios.

Availability and implementation: GATE is available at https://github.com/BioinfoMachineLearning/GATE.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
The workflow of GATE for predicting protein complex structure quality. The input consists of a set of protein complex structures (decoys) predicted from a protein sequence. (A) A pairwise similarity graph is constructed, where nodes represent individual decoys, and edges connect two structurally similar decoys. (B) Subgraphs are sampled based on structural similarity to make sure each group of similar models is equally represented in the subgraphs, preventing large groups from dominating small groups. (C) The sampled subgraphs are processed by a graph transformer to predict the quality score for each decoy. (D) Predicted quality scores from all sampled subgraphs are aggregated to produce the final quality score for each decoy.

Update of

Similar articles

Cited by

References

    1. Abramson J, Adler J, Dunger J et al. Accurate structure prediction of biomolecular interactions with alphafold 3. Nature 2024;630:493–500. - PMC - PubMed
    1. Alford RF, Leaver-Fay A, Jeliazkov JR et al. The rosetta all-atom energy function for macromolecular modeling and design. J Chem Theory Comput 2017;13:3031–48. - PMC - PubMed
    1. Basu S, Wallner B. Dockq: a quality measure for protein-protein docking models. PLoS One 2016;11:e0161879. - PMC - PubMed
    1. Cao R, Adhikari B, Bhattacharya D et al. Qacon: single model quality assessment using protein structural and contact information with machine learning techniques. Bioinformatics 2017;33:586–8. - PMC - PubMed
    1. Cao R, Bhattacharya D, Adhikari B et al. Large-scale model quality assessment for improving protein tertiary structure prediction. Bioinformatics 2015;31:i116–23. - PMC - PubMed

LinkOut - more resources