Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Oct 1;24(19):2149-56.
doi: 10.1093/bioinformatics/btn409. Epub 2008 Aug 1.

Protein-ligand interaction prediction: an improved chemogenomics approach

Affiliations

Protein-ligand interaction prediction: an improved chemogenomics approach

Laurent Jacob et al. Bioinformatics. .

Abstract

Motivation: Predicting interactions between small molecules and proteins is a crucial step to decipher many biological processes, and plays a critical role in drug discovery. When no detailed 3D structure of the protein target is available, ligand-based virtual screening allows the construction of predictive models by learning to discriminate known ligands from non-ligands. However, the accuracy of ligand-based models quickly degrades when the number of known ligands decreases, and in particular the approach is not applicable for orphan receptors with no known ligand.

Results: We propose a systematic method to predict ligand-protein interactions, even for targets with no known 3D structure and few or no known ligands. Following the recent chemogenomics trend, we adopt a cross-target view and attempt to screen the chemical space against whole families of proteins simultaneously. The lack of known ligand for a given target can then be compensated by the availability of known ligands for similar targets. We test this strategy on three important classes of drug targets, namely enzymes, G-protein-coupled receptors (GPCR) and ion channels, and report dramatic improvements in prediction accuracy over classical ligand-based virtual screening, in particular for targets with few or no known ligands.

Availability: All data and algorithms are available as Supplementary Material.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Distribution of the number of training points for a target for the enzymes, GPCR and ion channel datasets. Each bar indicates the proportion of targets in the family for which a given (x-axis) number of data points is available.
Fig. 2.
Fig. 2.
Target kernel Gram matrices (Ktar) for ion channels with multitask, hierarchy and local alignment kernels.
Fig. 3.
Fig. 3.
Relative improvement of the hierarchy kernel against the Dirac kernel as a function of the number of known ligands for enzymes, GPCR and ion channel datasets. Each point indicates the mean performance ratio between individual and hierarchy approaches across the targets of the family for which a given (x-axis) number of training points was available.

References

    1. Aronszajn N. Theory of reproducing kernels. Trans. Am. Math. Soc. 1950;68:337–404.
    1. Attwood TK, et al. Prints and its automatic supplement, preprints. Nucleic Acids Res. 2003;31:400–402. - PMC - PubMed
    1. Azencott C-A, et al. One- to four-dimensional kernels for virtual screening and the prediction of physical, chemical, and biological properties. J. Chem. Inf. Model. 2007;47:965–974. - PubMed
    1. Balakin KV, et al. Property-based design of GPCR-targeted library. J. Chem. Inf. Comput. Sci. 2002;42:1332–1342. - PubMed
    1. Ballesteros J, Palczewski K. G protein-coupled receptor drug discovery: implications from the crystal structure of rhodopsin. Curr. Opin. Drug Discov. Devel. 2001;4:561–574. - PMC - PubMed