Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Feb 28:9:105.
doi: 10.1186/1471-2164-9-105.

MADIBA: a web server toolkit for biological interpretation of Plasmodium and plant gene clusters

Affiliations

MADIBA: a web server toolkit for biological interpretation of Plasmodium and plant gene clusters

Philip J Law et al. BMC Genomics. .

Abstract

Background: Microarray technology makes it possible to identify changes in gene expression of an organism, under various conditions. Data mining is thus essential for deducing significant biological information such as the identification of new biological mechanisms or putative drug targets. While many algorithms and software have been developed for analysing gene expression, the extraction of relevant information from experimental data is still a substantial challenge, requiring significant time and skill.

Description: MADIBA (MicroArray Data Interface for Biological Annotation) facilitates the assignment of biological meaning to gene expression clusters by automating the post-processing stage. A relational database has been designed to store the data from gene to pathway for Plasmodium, rice and Arabidopsis. Tools within the web interface allow rapid analyses for the identification of the Gene Ontology terms relevant to each cluster; visualising the metabolic pathways where the genes are implicated, their genomic localisations, putative common transcriptional regulatory elements in the upstream sequences, and an analysis specific to the organism being studied.

Conclusion: MADIBA is an integrated, online tool that will assist researchers in interpreting their results and understand the meaning of the co-expression of a cluster of genes. Functionality of MADIBA was validated by analysing a number of gene clusters from several published experiments - expression profiling of the Plasmodium life cycle, and salt stress treatments of Arabidopsis and rice. In most of the cases, the same conclusions found by the authors were quickly and easily obtained after analysing the gene clusters with MADIBA.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A schematic representation of the flow of data through MADIBA. After a microarray experiment, data are normalised and then clustered, since it is hypothesised that the genes in a cluster have common biological implications. A cluster of genes is submitted to MADIBA, either as nucleotide sequences, or gene identifiers. This list of genes can then be subjected to five analysis modules – Gene Ontology Analysis, Metabolic Pathways Analysis, Transcription Regulation Analysis, Chromosomal Localisation Analysis and an Organism Specific Analysis. Also shown are the data that are required by each of the analysis modules. The results from the analyses can be exported as a PDF file, or as plain text.
Figure 2
Figure 2
Results from the Metabolic Pathways module. Analysis of cluster 6 of the Plasmodium data [40] revealed that it was noticeably involved in glycolysis. In the KEGG map for glycolysis, it could be seen that almost all of the enzymes involved are present in the cluster. Additionally, all of the enzymes were annotated by all the three annotation sources – the curated annotation from the original data source (PlasmoDB); the semi-automatic KEGG annotation and the automatic PRIAM annotation, as indicated by the yellow boxes.
Figure 3
Figure 3
Results from the Gene Ontology module. An analysis of the biological process ontology of the cluster 6 of the Plasmodium data [40] revealed that anaerobic glycolysis was the most significant term. The DAG was reduced to show only the terms that are most relevant to glucose metabolism. The grey ellipses contain the genes that are annotated to the connected GO term and the colour of the GO terms indicates different levels of significance, as indicated by the legend.
Figure 4
Figure 4
Results from the Arabidopsis data. (A) Analysis of cluster 0 from the Arabidopsis salt stress experiment [42] with the Metabolic Pathways module revealed that the cluster contained genes involved in lignin biosynthesis. The red colour indicates that the annotations were found by two annotation methods (PRIAM and KEGG in this case), and the purple indicates the enzyme was annotated by PRIAM only. (B) After analysing cluster 8 of the Arabidopsis data [42] with the Transcription Regulation module, it was possible to identify putative transcription factor binding sites. The output of the oligo-analysis tool of RSAT is shown, indicating two motifs on the reverse complement that were identified as similar to the WRKY binding site ((C/T)TGAC(T/C)) (highlighted in the red box). Cluster 8 is known to contain several WRKY transcription factors and several disease-resistance genes. (C) Output from the Patch program of the TRANSFAC sub-module. Shown is the PR-1a (a pathogenesis related protein) promoter binding site that was identified. The table headers are provided for convenience.

Similar articles

Cited by

References

    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. - DOI - PMC - PubMed
    1. Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32:D277–D280. doi: 10.1093/nar/gkh063. - DOI - PMC - PubMed
    1. Matys V, Fricke E, Geffers R, Gossling E, Haubrock M, Hehl R, Hornischer K, Karas D, Kel AE, Kel-Margoulis OV, Kloos DU, Land S, Lewicki-Potapov B, Michael H, Munch R, Reuter I, Rotert S, Saxel H, Scheer M, Thiele S, Wingender E. TRANSFAC(R): transcriptional regulation, from patterns to profiles. Nucl Acids Res. 2003;31:374–378. doi: 10.1093/nar/gkg108. - DOI - PMC - PubMed
    1. Al Shahrour F, Diaz-Uriarte R, Dopazo J. FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes. Bioinformatics. 2004;20:578–580. doi: 10.1093/bioinformatics/btg455. - DOI - PubMed
    1. Lenhard B, Hayes WS, Wasserman WW. GeneLynx: A Gene-Centric Portal to the Human Genome. Genome Res. 2001;11:2151–2157. doi: 10.1101/gr.199801. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources