Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 25;8(3):180-206.
doi: 10.5599/admet.794. eCollection 2020.

Can small drugs predict the intrinsic aqueous solubility of 'beyond Rule of 5' big drugs?

Affiliations

Can small drugs predict the intrinsic aqueous solubility of 'beyond Rule of 5' big drugs?

Alex Avdeef et al. ADMET DMPK. .

Abstract

The aim of the study was to explore to what extent small molecules (mostly from the Rule of 5 chemical space) can be used to predict the intrinsic aqueous solubility, S0, of big molecules from beyond the Rule of 5 (bRo5) space. It was demonstrated that the General Solubility Equation (GSE) and the Abraham Solvation Equation (ABSOLV) underpredict solubility in systematic but slightly ways. The Random Forest regression (RFR) method predicts solubility more accurately, albeit in the manner of a 'black box.' It was discovered that the GSE improves considerably in the case of big molecules when the coefficient of the log P term (octanol-water partition coefficient) in the equation is set to -0.4 instead of the traditional -1 value. The traditional GSE underpredicts solubility for molecules with experimental S0 < 50 μM. In contrast, the ABSOLV equation (trained with small molecules) underpredicts the solubility of big molecules in all cases tested. It was found that the errors in the ABSOLV-predicted solubilities of big molecules correlate linearly with the number of rotatable bonds, which suggests that flexibility may be an important factor in differentiating solubility of small from big molecules. Notably, most of the 31 big molecules considered have negative enthalpy of solution: these big molecules become less soluble with increasing temperature, which is compatible with 'molecular chameleon' behavior associated with intramolecular hydrogen bonding. The X-ray structures of many of these molecules reveal void spaces in their crystal lattices large enough to accommodate many water molecules when such solids are in contact with aqueous media. The water sorbed into crystals suspended in aqueous solution may enhance solubility by way of intra-lattice solute-water interactions involving the numerous H-bond acceptors in the big molecules studied. A 'Solubility Enhancement-Big Molecules' index was defined, which embodies many of the above findings.

Keywords: Abraham Solvation Equation (ABSOLV); General Solubility Equation (GSE); Partial Least Squares (PLS); Random Forest regression (RFR); Rule of 5 (Ro5), beyond Ro5 (bRo5); Solubility Enhancement–Big Molecules (SEBM); aqueous intrinsic solubility; intramolecular hydrogen bonding (IMHB).

PubMed Disclaimer

Conflict of interest statement

Conflict of interest: The authors declare no conflict of interest.

Figures

Figure 1.
Figure 1.
Distribution of the big-molecule intrinsic aqueous solubility values in Wiki-pS0
Figure 2.
Figure 2.
Plot of log S0 versus octanol-water partition coefficient, clogP, calculated using the RDKit software [62]. Squares refer to big molecules; circles refer to small molecules
Figure 3.
Figure 3.
Big-molecule property distributions: (a) clogP, (b) molecular weight (MW), and (c) number of H-bond donors (NHD) and acceptors (NHA). The separation between the groups is greater than that found in small molecules [20].
Figure 4.
Figure 4.
The log P in the figure refers to calculated octanol-water partition coefficients, clogP. The solid diagonals are the identity lines, and the dashed lines refer to ±0.5 log deviations. The MPP pie charts refer to percentage of ‘correct’ prediction, based on absolute residuals being ≤ 0.5 log. The prediction of log S0 values of (a) small molecules and (b) big molecules using the classical General Solubility Equation (numeric compound labels are of paclitaxel analogs). (c) When using just the big-molecule data, the three constants in the GSE (Eq. 1) subjected to MLR analysis (cf., Eq. 6b) produce the modified GSE, which is valid only for molecules with MW > 800 Da. There are not enough big molecules in the Wiki pS0 database to test the predictiveness of Eq. (6b).
Figure 5.
Figure 5.
The prediction of log S0 values of (a) small molecules and (b) big molecules using the Abraham Solvation Equation (ABSOLV). (c) An additional nonlinear descriptor was added to the ABSOLV equation (cf., Eq. 5), which was then trained with the small-molecule set. This improved the prediction accuracy of the modified ABSOLV equation. The pie chart denotes MPP, the fraction of ‘correctly’ predicted molecules (absolute residuals ≤ 0.5 log unit).
Figure 6.
Figure 6.
Random Forest regression analysis. (a) Training set using the small molecules. (b) Internal validation test set, based on 30% of the small molecules randomly selected. (c) External test set prediction of big molecules, not used in the method training.
Figure 7.
Figure 7.
Logarithm of the Solubility Enhancement–Big Molecules as a function of the number of rotatable bonds: log SEBM = log S0Obs – log S0ABSOLV.
None

References

    1. Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 46 (2001) 3–26. - PubMed
    1. Hörter D, Dressman JB. Influence of physicochemical properties on dissolution of drugs in the gastrointestinal tract. Adv Drug Deliv Rev. 46 (2001) 75–87. - PubMed
    1. Di L, Fish PV, Mano T. Bridging solubility between drug discovery and development. Drug Discov Today. 17 (2012) 486–95. - PubMed
    1. Bergström CAS, Holm R, Jørgensen SA, Andersson SBE, Artursson P, Beato S, et al. Early pharmaceutical profiling to predict oral drug absorption: Current status and unmet needs. Eur J Pharm Sci. 57 (2014) 173–99. - PubMed
    1. Dearden JC. In silico prediction of aqueous solubility. Expert Opin Drug Discov. 1 (2006) 31–52. - PubMed

LinkOut - more resources