Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007;8(8):R176.
doi: 10.1186/gb-2007-8-8-r176.

Distribution patterns of small-molecule ligands in the protein universe and implications for origin of life and drug discovery

Affiliations

Distribution patterns of small-molecule ligands in the protein universe and implications for origin of life and drug discovery

Hong-Fang Ji et al. Genome Biol. 2007.

Abstract

Background: Extant life depends greatly on the binding of small molecules (such as ligands) with macromolecules (such as proteins), and one ligand can bind multiple proteins. However, little is known about the global patterns of ligand-protein mapping.

Results: By examining 2,186 well-defined small-molecule ligands and thousands of protein domains derived from a database of druggable binding sites, we show that a few ligands bind tens of protein domains or folds, whereas most ligands bind only one, which indicates that ligand-protein mapping follows a power law. Through assigning the protein-binding orders (early or late) for bio-ligands, we demonstrate that the preferential attachment principle still holds for the power-law relation between ligands and proteins. We also found that polar molecular surface area, H-bond acceptor counts, H-bond donor counts and partition coefficient are potential factors to discriminate ligands from ordinary molecules and to differentiate super ligands (shared by three or more folds) from others.

Conclusion: These findings have significant implications for evolution and drug discovery. First, the chronology of ligand-protein binding can be inferred by the power-law feature of ligand-protein mapping. Some nucleotide-containing ligands, such as ATP, ADP, GDP, NAD, FAD, dihydro-nicotinamide-adenine-dinucleotide phosphate (NDP), nicotinamide-adenine-dinucleotide phosphate (NAP), flavin mononucleotide (FMN) and AMP, are found to be the earliest cofactors bound to proteins, agreeing with the current understanding of evolutionary history. Second, the finding that about 30% of ligands are shared by two or more domains will help with drug discovery, such as in finding new functions from old drugs, developing promiscuous drugs and depending more on natural products.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Power-law behaviors of ligand-protein binding. The number of ligands (N) decays with an increase in the number (L) of (a) domains and (b) folds that bind the ligand and follows the equation N = aL-b. The figure illustrates that a few ligands cover tens of protein domains or folds, while most ligands bind only one domain or fold.
Figure 2
Figure 2
Chemical space consisting of ligands (derived from sc-PDB) and ordinary molecules (randomly selected from ACD-SC), defined by the first two factors derived from 70 descriptors. The figure illustrates that ligands (in red) occupy the relatively upper part of the space, while ordinary molecules (in blue) occupy the relatively lower part, which means that it is the second factor that discriminates ligands from ordinary molecules. From the loadings of the second factor, it can be deduced that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are likely responsible for the differences between ligands and ordinary molecules, which is supported by the different average values of the four kinds of parameters for ligands and ordinary molecules (Table 2).
Figure 3
Figure 3
Chemical space consisting of sc-PDB-derived ligands, defined by the first two factors derived from 70 descriptors. The figure illustrates that super ligands (shared by 3+ folds; in blue), especially those that are shared by 10+ folds (in red), concentrate in the relatively upper part of the space (the area of the circle is directly proportional to the number of folds that bind the ligand), which suggests that polar molecular surface area, H-bond donor counts, H-bond acceptor counts and partition coefficient are responsible for the strong protein-binding potential of the super ligands, which is supported by the different average values of the four kinds of parameters for ligands with different protein-binding potentials (Table 2).

Similar articles

Cited by

References

    1. Barabási AL, Oltvai ZN. Network biology: understanding the cell's functional organization. Nat Rev Genet. 2004;5:101–113. doi: 10.1038/nrg1272. - DOI - PubMed
    1. Fewell JH. Social insect networks. Science. 2003;301:1867–1870. doi: 10.1126/science.1088945. - DOI - PubMed
    1. Whittaker RH, Feeny PP. Allelochemics: chemical interactions between species. Science. 1971;171:757–770. doi: 10.1126/science.171.3973.757. - DOI - PubMed
    1. Dixon RA. Natural products and plant disease resistance. Nature. 2001;411:843–847. doi: 10.1038/35081178. - DOI - PubMed
    1. Camilli A, Bassler BL. Bacterial small-molecule signaling pathways. Science. 2006;311:1113–1116. doi: 10.1126/science.1121357. - DOI - PMC - PubMed

Publication types