Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Jun 20:rs.3.rs-6845168.
doi: 10.21203/rs.3.rs-6845168/v1.

Boosting AlphaFold Protein Tertiary Structure Prediction through MSA Engineering and Extensive Model Sampling and Ranking in CASP16

Affiliations

Boosting AlphaFold Protein Tertiary Structure Prediction through MSA Engineering and Extensive Model Sampling and Ranking in CASP16

Jian Liu et al. Res Sq. .

Abstract

AlphaFold2 and AlphaFold3 have revolutionized protein structure prediction by enabling high-accuracy tertiary structure predictions for most single-chain proteins (monomers). However, obtaining high-quality predictions for some hard protein targets with shallow or noisy multiple sequence alignments (MSAs) and complicated multi-domain architectures remains challenging. Here, we present MULTICOM4, an integrative protein structure prediction system that uses diverse MSA generation, large-scale model sampling, and an ensemble model quality assessment (QA) strategy of combining individual QA methods to improve model generation and ranking of AlphaFold2 and AlphaFold3. In the 16th Critical Assessment of Techniques for Protein Structure Prediction (CASP16), our predictors built on MULTICOM4 ranked among the top performers out of 120 predictors in tertiary structure prediction and outperformed a standard AlphaFold3 predictor. The average TM-score of our best performing predictor MULTCOM's top-1 prediction for 84 CASP16 domain is 0.902. It achieved high accuracy (TM-score > 0.9) for 73.8% of the 84 domains and correct fold predictions (TM-score > 0.5) for 97.6% domains in terms of top-1 prediction. In terms of best-of-top-5 prediction, it predicted correct folds for all the domains. The results show that MSA engineering through the use of different protein sequence databases, alignment tools, and domain segmentation as well as extensive model sampling are the key to generate accurate and correct structural models. Additionally, using multiple complementary QA methods and model clustering can improve the robustness and reliability of model ranking.

Keywords: AlphaFold; deep learning; protein model quality assessment; protein structure prediction.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare no competing interests.

Figures

Fig. 1:
Fig. 1:
The overall performance of top 20 out of 120 CASP16 predictors and the detailed performance of MULTICOM across 75 unique domains. (A) The cumulative Z-scores of top 20 predictors. The standard AlphaFold3 predictor (AF3-server) run by Elofsson’s group ranked 29th with a cumulative Z-score 25.71 (not shown); (B) The Z-score of the top-1 prediction of MULTICOM for each of the 75 domains.
Fig. 2:
Fig. 2:
The structural comparison between the native structures (green) and the top-1 models submitted by MULTICOM (blue) for five domains and the GDT-TS scores of the models. (a) T1267s1-D1, (b) T1237-D1, (c) T1210-D1, (d) T1207-D1, and (e) T1257-D1.
Fig. 3:
Fig. 3:
The TM-scores of the top-1 and best-of-top-5 models submitted by MULTICOM for the 84 domains. (A) The TM-scores of the top-1 models for the 84 domains. (B) TM-scores of top-1 models vs TM-scores of best-of-top-5 models on the 84 domains.
Fig. 4:
Fig. 4:
Head-to-head TM-score comparison between MULTICOM and AF3-server on 84 protein domains: (A) top-1 models and (B) best-of-top-5 models.
Fig. 5:
Fig. 5:
Comparison of GDT-TS performance between the in-house AlphaFold2, the in-house AlphaFold3 predictor, and CASP16 AF3-server for the 12 single-chain monomer targets. The plots show the top-1 and best-of-top-5 models generated by our in-house AlphaFold2, our in-house AlphaFold3 predictors, and AF3-server.
Fig. 6:
Fig. 6:
Comparison of predicted structures for target T1226-D1. (a) Native structure; (b) best model among the top-5 predictions of the in-house AlphaFold3 (GDT-TS = 0.650); (c) top-1 model of the in-house AlphaFold3 (GDT-TS = 0.336); (d) top-1 model of the in-house AlphaFold2 (GDT-TS = 0.334). The best-of-top-5 model of the in-house AlphaFold3 correctly predicted the folding of the C-terminal helices and their long-range interaction with other regions, while the other models did not.
Fig. 7:
Fig. 7:
Comparison of GDT-TS performance for top-1 models generated using different MSA sources. Each MSA was evaluated using AlphaFold2 under identical input settings.
Fig. 8:
Fig. 8:
Comparison of predicted structures for target T1266-D1 using full-length MSA and domain-based MSAs. (a) Native structure; (b) Full-length MSAs, including ColabFold, DeepMSA_qMSA, DeepMSA_dMSA, Default_AF2, ESM-MSA and DHR; (c) Domain-based MSAs with different domain segmentation methods: DomainParser, HHsearch, UniDoc and Manual.
Fig. 9:
Fig. 9:
GDT-TS scores of the top-1 models selected by different QA methods across 12 single-chain monomer targets. The five QA methods include AlphaFold2 plDDT, pairwise similarity score (PSS), GATE, EnQA, and GCPNet-EMA. MULTICOM represents the result of the final top-1 models submitted by the MULTICOM predictor, which integrated the multiple QA methods. Best (the virtual line) represents the upper bound (highest) of the GDT-TS in the model pool for each target.
Fig. 10:
Fig. 10:
Overview of the tertiary structure prediction module of MULTICOM4. The module begins with multiple sequence alignment (MSA) sampling on both general protein sequence databases (e.g., UniRef30, UniRef90) and specialized metagenomic sources (e.g., BFD, MGnify) using different alignment tools. For multi-domain targets, in addition to generating full-length MSAs, domain-based MSA construction is performed by segmenting the target sequence into domains, generating individual domain MSAs, and combining them into full-length MSAs through alignment pairing and gap padding. Templates are identified by aligning sequence profiles against the PDB70 and PDB sort90 databases. Structure prediction is carried out using multiple deep learning–based predictors, including a customized AlphaFold2 pipeline with various MSAs and templates as input (mainly), AlphaFold3 web server (extensive sampling, mainly), and ESMFold (for single-protein sequence modeling only). Predicted models are evaluated by multiple model quality assessment (QA) methods/metrics (e.g., GATE, PSS, and plDDT) to select high-confidence final predictions.

Similar articles

References

    1. Eickholt J. & Cheng J. Predicting protein residue–residue contacts using deep networks and boosting. Bioinformatics 28, 3066–3072 (2012). - PMC - PubMed
    1. Hou J., Adhikari B. & Cheng J. Deepsf: deep convolutional neural network for mapping protein sequences to folds. Bioinformatics 34, 1295–1303 (2018). - PMC - PubMed
    1. Adhikari B., Hou J. & Cheng J. Dncon2: improved protein contact prediction using two-level deep convolutional neural networks. Bioinformatics 34, 1466–1472 (2018). - PMC - PubMed
    1. Xu J. Distance-based protein folding powered by deep learning. Proceedings of the National Academy of Sciences 116, 16856–16865 (2019). - PMC - PubMed
    1. Senior A. W. et al. Improved protein structure prediction using potentials from deep learning. Nature 577, 706–710 (2020). - PubMed

Publication types

LinkOut - more resources