Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Feb;35(2):131-166.
doi: 10.1007/s10822-020-00362-6. Epub 2021 Jan 4.

Overview of the SAMPL6 pKa challenge: evaluating small molecule microscopic and macroscopic pKa predictions

Affiliations

Overview of the SAMPL6 pKa challenge: evaluating small molecule microscopic and macroscopic pKa predictions

Mehtap Işık et al. J Comput Aided Mol Des. 2021 Feb.

Abstract

The prediction of acid dissociation constants (pKa) is a prerequisite for predicting many other properties of a small molecule, such as its protein-ligand binding affinity, distribution coefficient (log D), membrane permeability, and solubility. The prediction of each of these properties requires knowledge of the relevant protonation states and solution free energy penalties of each state. The SAMPL6 pKa Challenge was the first time that a separate challenge was conducted for evaluating pKa predictions as part of the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) exercises. This challenge was motivated by significant inaccuracies observed in prior physical property prediction challenges, such as the SAMPL5 log D Challenge, caused by protonation state and pKa prediction issues. The goal of the pKa challenge was to assess the performance of contemporary pKa prediction methods for drug-like molecules. The challenge set was composed of 24 small molecules that resembled fragments of kinase inhibitors, a number of which were multiprotic. Eleven research groups contributed blind predictions for a total of 37 pKa distinct prediction methods. In addition to blinded submissions, four widely used pKa prediction methods were included in the analysis as reference methods. Collecting both microscopic and macroscopic pKa predictions allowed in-depth evaluation of pKa prediction performance. This article highlights deficiencies of typical pKa prediction evaluation approaches when the distinction between microscopic and macroscopic pKas is ignored; in particular, we suggest more stringent evaluation criteria for microscopic and macroscopic pKa predictions guided by the available experimental data. Top-performing submissions for macroscopic pKa predictions achieved RMSE of 0.7-1.0 pKa units and included both quantum chemical and empirical approaches, where the total number of extra or missing macroscopic pKas predicted by these submissions were fewer than 8 for 24 molecules. A large number of submissions had RMSE spanning 1-3 pKa units. Molecules with sulfur-containing heterocycles or iodo and bromo groups were less accurately predicted on average considering all methods evaluated. For a subset of molecules, we utilized experimentally-determined microstates based on NMR to evaluate the dominant tautomer predictions for each macroscopic state. Prediction of dominant tautomers was a major source of error for microscopic pKa predictions, especially errors in charged tautomers. The degree of inaccuracy in pKa predictions observed in this challenge is detrimental to the protein-ligand binding affinity predictions due to errors in dominant protonation state predictions and the calculation of free energy corrections for multiple protonation states. Underestimation of ligand pKa by 1 unit can lead to errors in binding free energy errors up to 1.2 kcal/mol. The SAMPL6 pKa Challenge demonstrated the need for improving pKa prediction methods for drug-like molecules, especially for challenging moieties and multiprotic molecules.

Keywords: Acid dissociation constant; Blind prediction challenge; Macroscopic pK a; Macroscopic protonation state; Microscopic pK a; Microscopic protonation state; SAMPL; Small molecule; pK a.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Distribution of molecular properties of the 24 compounds from the SAMPL6 pKa Challenge.
A Histogram of spectrophotometric pKa measurements collected with Sirius T3 [8]. The overlaid rug plot indicates the actual values. Five compounds have multiple measured pKas in the range of 2–12. B Histogram of molecular weights calculated for the neutral state of the compounds in SAMPL6 set. Molecular weights were calculated by neglecting counterions. C Histogram of the number of non-terminal rotatable bonds in each molecule. D The histogram of the ratio of heteroatom (non-carbon heavy atoms including, O, N, F, S, Cl, Br, I) count to the number of carbon atoms.
Figure 2.
Figure 2.. RMSE and unmatched pKa counts vs. submission ID plots for macroscopic pKa predictions based on Hungarian matching.
Methods are indicated by submission IDs. RMSE is shown with error bars denoting 95% confidence intervals obtained by bootstrapping over challenge molecules. Submissions are colored by their method categories. Light blue colored database lookup methods are utilized as the null prediction method. QM methods category (navy) includes pure QM, QM+LEC, and QM+MM approaches. Lower bar plots show the number of unmatched experimental pKa values (light grey, missing predictions) and the number of unmatched pKa predictions (dark grey, extra predictions) for each method between pH 2 and 12. Submission IDs are summarized in Table 1. Submission IDs of the form nb### refer to non-blinded reference methods computed after the blind challenge submission deadline. All others refer to blind, prospective predictions.
Figure 3.
Figure 3.. Additional performance statistics for macroscopic pKa predictions based on Hungarian matching.
Methods are indicated by submission IDs. Mean absolute error (MAE), mean error (ME), Pearson’s R2, and Kendall’s Rank Correlation Coefficient Tau (τ) are shown, with error bars denoting 95% confidence intervals were obtained by bootstrapping over challenge molecules. Refer to Table 1 for the submission IDs and method names. Submissions are colored by their method categories. Light blue colored database lookup methods are utilized as the null prediction method.
Figure 4.
Figure 4.. Predicted vs. experimental macroscopic pKa prediction for four consistently well-performing methods, a representative method with average performance (2ii2g), and the null method (5nm4j).
When submissions were ranked according to RMSE, MAE, R2, and τ, four methods ranked in the Top 10 consistently in each of these metrics. Dark and light green shaded areas indicate 0.5 and 1.0 units of error. Error bars indicate standard error of the mean of predicted and experimental values. Experimental pKa SEM values are too small to be seen under the data points. EC-RISM/MP2/cc-pVTZ-P2-q-noThiols-2par method (2ii2g) was selected as the representative method with average performance because it is the method with the highest RMSE below the median.
Figure 5.
Figure 5.. Molecules from the SAMPL6 Challenge with MAE calculated for all macroscopic pKa predictions.
The MAE calculated over all prediction methods indicates which molecules had the lowest prediction accuracy in the SAMPL6 Challenge. MAE values calculated for each molecule include all the matched pKa values. SM06, SM14, SM15, SM16, SM18, and SM22 were multiprotic. Hungarian matching algorithm was employed for pairing experimental and predicted pKa values. MAE values are reported with 95% confidence intervals.
Figure 6.
Figure 6.. Average prediction accuracy calculated over all prediction methods was poorer for molecules with sulfur-containing heterocycles, bromo, and iodo groups.
(A) MAE calculated for each molecule as an average of all methods. (B) MAE of each molecule broken out by method category. QM-based methods (blue) include QM predictions with or without linear empirical correction. Empirical methods (green) include QSAR, ML, DL, and LFER approaches. (C) Depiction of SAMPL6 molecules with sulfur-containing heterocycles. (D) Depiction of SAMPL6 molecules with iodo and bromo groups.
Figure 7.
Figure 7.. Macroscopic pKa prediction error distribution plots show how prediction accuracy varies across methods and individual molecules.
(A) pKa prediction error distribution for each submission for all molecules according to Hungarian matching. (B) Error distribution for each SAMPL6 molecule for all prediction methods according to Hungarian matching. For multiprotic molecules, pKa ID numbers (pKa1, pKa2, and pKa3) were assigned in the direction of increasing experimental pKa value.
Figure 8.
Figure 8.. NMR determination of dominant microstates allowed in-depth evaluation of microscopic pKa predictions for 8 compounds.
A Dominant microstate sequence of two compounds (SM07 and SM14) were determined by NMR [8]. Based on these reference compounds, the dominant microstates of 6 related compounds were inferred and experimental pKa values were assigned to titratable groups with the assumption that only the dominant microstates have significant contributions to the experimentally observed pKa. B RMSE vs. submission ID and unmatched pKa vs. submission ID plots for the evaluation of microscopic pKa predictions of 8 molecules by Hungarian matching to experimental macroscopic pKa values. C RMSE vs. submission ID and unmatched pKa vs. submission ID plots showing the evaluation of microscopic pKa predictions of 8 molecules by microstate-based matching between predicted microscopic pKas and experimental macroscopic pKa values. Submissions 0wfzo, z3btx, 758j8, and hgn83 have RMSE values bigger than 10 pKa units which are beyond the y-axis limits of subplot C and B. RMSE is shown with error bars denoting 95% confidence intervals obtained by bootstrapping over the challenge molecules. Lower bar plots show the number of unmatched experimental pKas (light grey, missing predictions) and the number of unmatched pKa predictions (dark grey, extra predictions) for each method between pH 2 and 12. Submission IDs are summarized in Table 1.
Figure 9.
Figure 9.. Additional performance statistics for microscopic pKa predictions for 8 molecules with experimentally determined dominant microstates.
Microstate-based matching was performed between experimental pKa values and predicted microscopic pKa values. Mean absolute error (MAE), mean error (ME), Pearson’s R2, and Kendall’s Rank Correlation Coefficient Tau (τ) are shown, with error bars denoting 95% confidence intervals obtained by bootstrapping over challenge molecules. Methods are indicated by their submission IDs. Submissions are colored by their method categories. Refer to Table 1 for submission IDs and method names. Submissions 0wfzo, z3btx, 758j8, and hgn83 have MAE and ME values bigger than 10 pKa units which are beyond the y-axis limits of subplots A and B. A large number and wide variety of methods have statistically indistinguishable performance based on correlation statistics (C and D), in part because of the relatively small dynamic range and small size of the set of 8 molecules.
Figure 10.
Figure 10.. Some methods predicted the sequence of dominant tautomers inaccurately.
Prediction accuracy of the dominant microstate of each charged state was calculated using the dominant microstate sequence determined by NMR for 8 molecules as reference. (A) Dominant microstate accuracy vs. submission ID plot was calculated considering all the dominant microstates seen in the experimental microstate dataset of 8 molecules. (B) Dominant microstate accuracy vs. submission ID plot was generating considering only the dominant microstates of charge 0 and +1 seen in the 8 molecule dataset. The accuracy of each molecule is broken out by the total charge of the microstate. (C) Dominant microstate prediction accuracy calculated for each molecule averaged over all methods. In (B) and (C), the accuracy of predicting the dominant neutral tautomer is shown in blue and the accuracy of predicting the dominant +1 charged tautomer is shown in green. Error bars denoting 95% confidence intervals obtained by bootstrapping.
Figure 11.
Figure 11.. Aqueous ligand pKa can influence overall protein-ligand binding affinity.
A When only the minor aqueous protonation state contributes to protein-ligand complex formation, the overall binding free energy (ΔGbind) needs to be calculated as the sum of binding affinity of the minor state and the protonation penalty of that state. B When multiple charge states contribute to complex formation, the overall free energy of binding includes a multiple protonation states correction (MPSC) term (ΔGcorr). MPSC is a function of pH, aqueous pKa of the ligand, and the difference between the binding free energy of charged and neutral species (ΔGbindCΔGbindN).
Figure 12.
Figure 12.. Inaccuracy of pKa prediction (± 1 unit) affects the the accuracy of MPSC and overall protein-ligand binding free energy calculations to varying degrees based on aqueous pKa and relative binding affinity of individual protonation states ΔΔG=ΔGbindCΔGbindN).
All calculations are made for 25°C, and a ligand with a single basic titratable group. A, C, E, and G show MPSC (ΔGcorr) calculated with true vs. inaccurate pKa. B, D, F, and H show the comparison of the absolute error to ΔGbind caused by ignoring the MPSC completely (solid lines) vs. calculating MPSC based in inaccurate pKa value (dashed lines). These plots provide guidance on when it is beneficial to include MPSC correction based on pKa error, pH - pKa, and ΔΔG.

References

    1. Manallack DT, Prankerd RJ, Yuriev E, Oprea TI, Chalmers DK. The Significance of Acid/Base Properties in Drug Discovery. Chem Soc Rev. 2013; 42(2):485–496. doi: 10.1039/C2CS35348B. - DOI - PMC - PubMed
    1. Charifson PS, Walters WP. Acidic and Basic Drugs in Medicinal Chemistry: A Perspective. Journal of Medicinal Chemistry. 2014. December; 57(23):9701–9717. doi: 10.1021/jm501000a. - DOI - PubMed
    1. Manallack DT, Prankerd RJ, Nassta GC, Ursu O, Oprea TI, Chalmers DK. A Chemogenomic Analysis of Ionization Constants-Implications for Drug Discovery. ChemMedChem. 2013. February; 8(2):242–255. doi: 10.1002/cmdc.201200507. - DOI - PMC - PubMed
    1. de Oliveira C, Yu HS, Chen W, Abel R, Wang L. Rigorous Free Energy Perturbation Approach to Estimating Relative Binding Affinities between Ligands with Multiple Protonation and Tautomeric States. Journal of Chemical Theory and Computation. 2019. January; 15(1):424–435. doi: 10.1021/acs.jctc.8b00826. - DOI - PubMed
    1. Darvey IG. The Assignment of pKa Values to Functional Groups in Amino Acids. Biochemical Education. 1995. April; 23(2):80–82. doi: 10.1016/0307-4412(94)00150-N. - DOI

Publication types