Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 12;64(3):960-973.
doi: 10.1021/acs.jcim.3c01761. Epub 2024 Jan 22.

Conservation of Hot Spots and Ligand Binding Sites in Protein Models by AlphaFold2

Affiliations

Conservation of Hot Spots and Ligand Binding Sites in Protein Models by AlphaFold2

Ayse A Bekar-Cesaretli et al. J Chem Inf Model. .

Abstract

The neural network-based program AlphaFold2 (AF2) provides high accuracy structure prediction for a large fraction of globular proteins. An important question is whether these models are accurate enough for reliably docking small ligands. Several recent papers and the results of CASP15 reveal that local conformational errors reduce the success rates of direct ligand docking. Here, we focus on the ability of the models to conserve the location of binding hot spots, regions on the protein surface that significantly contribute to the binding free energy of the protein-ligand interaction. Clusters of hot spots predict the location and even the druggability of binding sites, and hence are important for computational drug discovery. The hot spots are determined by protein mapping that is based on the distribution of small fragment-sized probes on the protein surface and is less sensitive to local conformation than docking. Mapping models taken from the AlphaFold Protein Structure Database show that identifying binding sites is more reliable than docking, but the success rates are still 5% to 10% lower than based on mapping X-ray structures. The drop in accuracy is particularly large for models of multidomain proteins. However, both the model binding sites and the mapping results can be substantially improved by generating AF2 models for the ligand binding domains of interest rather than the entire proteins and even more if using forced sampling with multiple initial seeds. The mapping of such models tends to reach the accuracy of results obtained by mapping the X-ray structures.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
a) Binding sites in surface representation for Entry 20 of the AlphaFold model P28720 (green), bound structure 1S39 (cyan), and unbound structure 4Q8M (red), with the ligand AQO inside each site, shown in sticks representation. b) Binding sites in surface representation for Entry 12 of the AlphaFold model P08709 (green), bound structure 5PAW (cyan), and unbound structure 1JBU (red), with the ligand 7XM inside each site, shown in sticks representation. c) Binding sites in surface representation for Entry 21 of the AlphaFold model P00734 (green), bound structure 3P70 (cyan), and unbound structure 2UUF (red), with the ligand BEN inside each site, shown in sticks representation.
Figure 2.
Figure 2.
a) Complete AlphaFold model provided by the AlphaFold database for the protein with UniProt ID P00734. Contains multiple domains denoted by specific colors: orange - Gla domain, magenta - Kringle 1 domain, green - Kringle 2 domain, red - peptidase S1 domain, yellow - high-affinity receptor binding region. b) Binding sites of the X-ray structure 3P70_H (cyan), of the AF2 model downloaded from the AF database with UniProt ID P00734 (green), and of the AF2 model generated using only the sequence of the ligand-binding domain (yellow). Representative probes in the strongest hot spots predicted by FTMap are shown as wires in yellow, brown, and orange, respectively, for each model.
Figure 3.
Figure 3.
a) All-atom RMSDs for the global alignment of the AF model P00734 from the AlphaFold database and the X-ray structures of 3P70_H with 90% sequence identity. 3P70_H is the PDB ID of the reference, ligand-bound X-ray structure and 2UUF_B is the unbound X-ray structure. RMSDs of both structures are depicted in red arrows on the graph. b) All-atom RMSDs for the global alignment of the AF2 model generated for the ligand binding domain of P00734 and the X-ray structures of 3P70_H with 90% sequence identity. 3P70_H is the PDB ID of the reference, ligand-bound X-ray structure and 2UUF_B is the unbound X-ray structure. RMSDs of both structures are depicted in red arrows on the graph. c) Density plots for the all-atom RMSDs of the global alignment of the AF models P00734 with X-ray structures of 3P70_H with 90% sequence identity.
Figure 4.
Figure 4.
a) a) All-atom RMSDs from the local alignment of the AF model P00734 from the AlphaFold database and the X-ray structures of 3P70_H with 90% sequence identity. 3P70_H is the PDB ID of the reference, ligand-bound X-ray structure and 2UUF_B is the unbound X-ray structure. RMSDs of both structures are depicted in red arrows on the graph. b) All-atom RMSDs from the local alignment of the AF2 model generated for the ligand binding domain of P00734 and the X-ray structures of 3P70_H with 90% sequence identity. 3P70_H is the PDB ID of the reference, ligand-bound X-ray structure and 2UUF_B is the unbound X-ray structure. RMSDs of both structures are depicted in red arrows on the graph. c) Density plots for the all-atom RMSDs of the local alignment of the AF models of the P00734 with X-ray structures of 3P70_H with 90% sequence identity.

References

    1. Jumper J; Evans R; Pritzel A; Green T; Figurnov M; Ronneberger O; Tunyasuvunakool K; Bates R; Zidek A; Potapenko A; Bridgland A; Meyer C; Kohl SAA; Ballard AJ; Cowie A; Romera-Paredes B; Nikolov S; Jain R; Adler J; Back T; Petersen S; Reiman D; Clancy E; Zielinski M; Steinegger M; Pacholska M; Berghammer T; Bodenstein S; Silver D; Vinyals O; Senior AW; Kavukcuoglu K; Kohli P; Hassabis D, Highly Accurate Protein Structure Prediction with Alphafold. Nature 2021, 596, 583–589. - PMC - PubMed
    1. Skolnick J; Gao M; Zhou H; Singh S, Alphafold 2: Why It Works and Its Implications for Understanding the Relationships of Protein Sequence, Structure, and Function. J Chem Inf Model 2021, 61, 4827–4831. - PMC - PubMed
    1. Thornton JM; Laskowski RA; Borkakoti N, Alphafold Heralds a Data-Driven Revolution in Biology and Medicine. Nat Med 2021, 27, 1666–1669. - PubMed
    1. Jumper J; Evans R; Pritzel A; Green T; Figurnov M; Ronneberger O; Tunyasuvunakool K; Bates R; Zidek A; Potapenko A; Bridgland A; Meyer C; Kohl SAA; Ballard AJ; Cowie A; Romera-Paredes B; Nikolov S; Jain R; Adler J; Back T; Petersen S; Reiman D; Clancy E; Zielinski M; Steinegger M; Pacholska M; Berghammer T; Silver D; Vinyals O; Senior AW; Kavukcuoglu K; Kohli P; Hassabis D, Applying and Improving Alphafold at Casp14. Proteins 2021, 89, 1711–1721. - PMC - PubMed
    1. Elofsson A, Progress at Protein Structure Prediction, as Seen in Casp15. Curr Opin Struct Biol 2023, 80, 102594. - PubMed

Publication types

LinkOut - more resources