Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2006 Nov 17:5:110.
doi: 10.1186/1475-2875-5-110.

Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?

Affiliations
Review

Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?

Lyn-Marie Birkholtz et al. Malar J. .

Abstract

The organization and mining of malaria genomic and post-genomic data is important to significantly increase the knowledge of the biology of its causative agents, and is motivated, on a longer term, by the necessity to predict and characterize new biological targets and new drugs. Biological targets are sought in a biological space designed from the genomic data from Plasmodium falciparum, but using also the millions of genomic data from other species. Drug candidates are sought in a chemical space containing the millions of small molecules stored in public and private chemolibraries. Data management should, therefore, be as reliable and versatile as possible. In this context, five aspects of the organization and mining of malaria genomic and post-genomic data were examined: 1) the comparison of protein sequences including compositionally atypical malaria sequences, 2) the high throughput reconstruction of molecular phylogenies, 3) the representation of biological processes, particularly metabolic pathways, 4) the versatile methods to integrate genomic data, biological representations and functional profiling obtained from X-omic experiments after drug treatments and 5) the determination and prediction of protein structures and their molecular docking with drug candidate structures. Recent progress towards a grid-enabled chemogenomic knowledge space is discussed.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Malaria functional genomics (X-omics) strategies in the context of target and drug characterization. Selected questions that could be addressed by the application of functional genomics are listed, including those specific to the transcriptome, proteome or interactome (X-omes). Highlighted is the particular focus on the application of this type of strategies to drug(s) and target(s) characterization.
Figure 2
Figure 2
Current pipeline for the homology-based modelling of malaria protein 3D-structures. This scheme emphasizes on the currently available methods to overcome amino acid bias, low sequence identity, protein inserts etc. Future upgrades include the refinement of each of these methods, for instance implementing asymmetric substitution matrices discussed in the text, that take into account the different amino acid distributions of malarial and non-malarial proteins for pairwise alignments.
Figure 3
Figure 3
In silico screening for protein ligands based on structural docking. A urea analog inhibiting malaria plasmepsins was identified with good score from the first WISDOM (World-wide In Silico Docking On Malaria) campaign. The WISDOM initiative successfully deployed large scale in silico docking on the European public EGEE grid infrastructure. The ligand shown here docks inside the binding pocket of plasmepsin, and interacts with key protein residues. The developed and established protocols can be used for new targets, and particularly a second computing challenge, WISDOM II.
Figure 4
Figure 4
Malaria chemogenomics: organization and treatment of genomic, post-genomic and chemical information for the prediction and characterization of target and drug candidates. Genomic data from Plasmodium and other species (a), i.e. protein sequences, should be organized based on sequence similarity (b). This projection should allow the high throughput reconstruction of molecular phylogenies both at the intraspecific (connecting paralogs and alleles) and interspecific (connecting homologues among which orthologs) levels following statistically accurate methods e.g. the TULIP method. Another substantial side of the biological space is designed by representing the knowledge of the biological processes, using stable ontologies e.g. the GO, and dynamic graph representation, e.g. PlasmoCyc (c). Versatile tools should allow the integration of genomic data, biological process representations and global functional profiles obtained with diverse X-omic approaches (4). These tools should comply with the large diversity of technologies and mining methods. The collection of information on the biological response to drugs is one of the doors to connect the biological space with the chemical space, following the "reverse chemical genetic" way, i.e. "from known drugs to biological response" (toxicity, mode of action). The other door to connect the chemical space and the biological space follows the "direct chemical genetic" way, i.e. "from known biological target to drug candidates". In addition to malaria protein structures obtained from crystals, the automated structural annotation of the malaria proteome should be initiated with quality scores (e). Based on protein structure information, virtual docking campaigns such as the WISDOM challenges can be achieved using the power of computer grids. The in silico organization of the small molecules stored in chemolibraries (f) follows similar principles, in particular the determination of three-dimensional structures of small molecules (g) and a clustering of small molecular structures based on drug properties and descriptors (h). Sharing and mining of chemogenomic information, completed with knowledge harvested in unstructured scientific literature, would benefit of the advances in knowledge space design and deployment on knowledge grids.

References

    1. World Malaria Report 2005. Geneva, World Health Organization, WHO/UNICEF. 2005.
    1. Grover-Kopec EK, Blumenthal MB, Ceccato P, Dinku T, Omumbo JA, Connor SJ. Web-based climate information resources for malaria control in Africa. Malar J. 2006;5:38. - PMC - PubMed
    1. Adl SM, Simpson AG, Farmer MA, Andersen RA, Anderson OR, Barta JR, Bowser SS, Brugerolle G, Fensome RA, Fredericq S, James TY, Karpov S, Kugrens P, Krug J, Lane CE, Lewis LA, Lodge J, Lynn DH, Mann DG, McCourt RM, Mendoza L, Moestrup O, Mozley-Standridge SE, Nerad TA, Shearer CA, Smirnov AV, Spiegel FW, Taylor MF. The new higher level classification of eukaryotes with emphasis on the taxonomy of protists. J Eukaryot Microbiol. 2005;52:399–351. - PubMed
    1. Desowitz RS. Malaria: from quinine to the vaccine. Hosp Pract. 1992;27:209–14. 217–24, 229–32. - PubMed
    1. Utzinger J, Tanner M, Kammen DM, Killeen GF, Singer BH. Integrated program is key to malarial control. Nature. 2002;419:431. - PubMed

Publication types

LinkOut - more resources