Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar 3;11(3):716-26.
doi: 10.1021/mp400450m. Epub 2014 Feb 18.

ADMET evaluation in drug discovery. 13. Development of in silico prediction models for P-glycoprotein substrates

Affiliations

ADMET evaluation in drug discovery. 13. Development of in silico prediction models for P-glycoprotein substrates

Dan Li et al. Mol Pharm. .

Abstract

P-glycoprotein (P-gp) actively transports a wide variety of chemically diverse compounds out of cells. It is highly associated with the ADMET properties of drugs and drug candidates and, moreover, plays a major role in the multidrug resistance (MDR) phenomenon, which leads to the failure of chemotherapy in cancer treatments. Therefore, the recognition of potential P-gp substrates at the early stages of the drug discovery process is quite important. Here, we compiled an extensive data set containing 423 P-gp substrates and 399 nonsubstrates, which is the largest P-gp substrate/nonsubstrate data set yet published. Comparison of the distributions of eight important physicochemical properties for the substrates and nonsubstrates reveals that molecular weight and molecular solubility are the informative attributes differentiating P-gp substrates from nonsubstrates. Examination of the distributions of eight physicochemical properties for 735 P-gp inhibitors and 423 substrates gives the fact that inhibitors are significantly more hydrophobic than substrates while substrates tend to have more H-bond donors than inhibitors. Then, the classification models based on simple molecular properties, topological descriptors, and molecular fingerprints were developed using the naive Bayesian classification technique. The best naive Bayesian classifier yields a Matthews correlation coefficient of 0.824 and a prediction accuracy of 91.2% for the training set from a 5-fold cross-validation procedure, and a Matthews correlation coefficient of 0.667 and a prediction accuracy of 83.5% for the test set containing 200 molecules. Analysis of the important structural fragments given by the Bayesian classifier shows that the essential H-bond acceptors arranged in distinct spatial patterns and flexibility are quite essential for P-gp substrate-likeness, which affords a deeper understanding on the molecular basis of substrate/P-gp interaction. Finally, the reasons for mispredictions were discussed. It turns out that the presented classifier could be used as a reliable virtual screening tool for identifying potential substrates of P-gp.

Keywords: ADME; ADMET; P-glycoprotein; fingerprint; naive Bayesian classification; substrates.

PubMed Disclaimer

Publication types

MeSH terms

Substances

LinkOut - more resources