Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2005 May 18;33(9):2822-37.
doi: 10.1093/nar/gki573. Print 2005.

Prediction of functional modules based on comparative genome analysis and Gene Ontology application

Affiliations
Comparative Study

Prediction of functional modules based on comparative genome analysis and Gene Ontology application

Hongwei Wu et al. Nucleic Acids Res. .

Abstract

We present a computational method for the prediction of functional modules encoded in microbial genomes. In this work, we have also developed a formal measure to quantify the degree of consistency between the predicted and the known modules, and have carried out statistical significance analysis of consistency measures. We first evaluate the functional relationship between two genes from three different perspectives--phylogenetic profile analysis, gene neighborhood analysis and Gene Ontology assignments. We then combine the three different sources of information in the framework of Bayesian inference, and we use the combined information to measure the strength of gene functional relationship. Finally, we apply a threshold-based method to predict functional modules. By applying this method to Escherichia coli K12, we have predicted 185 functional modules. Our predictions are highly consistent with the previously known functional modules in E.coli. The application results have demonstrated that our approach is highly promising for the prediction of functional modules encoded in a microbial genome.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The directed acyclic graph induced from the GO term UDP-N-acetylgalactosamine biosynthesis (GO: 0019277), wherein at the bottommost level is the GO term of interest itself, and at the upper levels are all its ancestors, adapted from QuickGO Go Browser ().
Figure 2
Figure 2
Distribution of SGO(gi, gj) for the positive (circles) and the random (triangles) sets.
Figure 3
Figure 3
Distribution of d(gi, gj) for the positive (blue) and the random (red) sets.
Figure 4
Figure 4
Distribution of SN(gi, gj) for the positive (blue) and the random (red) sets.
Figure 5
Figure 5
The normalized Combined(gi, gj) of both naive Bayesian (red) and Bayesian (blue) inference approaches for both the positive (solid) and the random (dashed) sets.
Figure 6
Figure 6
(a) The number of edges as a function of α; and (b) AHMD values for the known pathways, regulons and operons as functions of α.
Figure 7
Figure 7
Predicted modules consisting of at least three genes obtained by using α = 6.75, where edges of red represent for belonging to the same known pathways, edges of blue represent for belonging to the same known regulons, edges of green represent for belonging to the same known operons, edges of orange represent for transporter unit, edges of purple represent for having similar GO assignments and edges of black represent for having not been experimentally verified.
Figure 8
Figure 8
Predicted module corresponding to the flagellar metabolism pathway, where according to Eco Cyc the genes of blue belong to the same regulon, and the genes of yellow, green and dark red belong to three different operons, respectively.
Figure 9
Figure 9
Predicted modules involving more than one pathways.
Figure 10
Figure 10
Genes are connected mainly because they are conserved neighboring genes along the same strand of DNA.
Figure 11
Figure 11
Predicted modules that have not been experimentally verified.
Figure 12
Figure 12
Predicted modules that hemL (16128147) is involved for different values of α: (a) α = 6.75, (b) α = 5.5 and (c) α = 4.
Figure 12
Figure 12
Predicted modules that hemL (16128147) is involved for different values of α: (a) α = 6.75, (b) α = 5.5 and (c) α = 4.

Similar articles

Cited by

References

    1. Kremling A., Jahreis K., Lengeler J.W., Gilles E.D. The organization of metabolic reaction networks: a signal-oriented approach to cellular models. Metab. Eng. 2000;2:190–200. - PubMed
    1. Wagner R. Transcription Regulation in Prokaryotes. Oxford, UK: Oxford University Press; 2000.
    1. Stephanopoulos G.N., Aristidou A.A., Nielsen J. Metabolic Engineering: Principles and Methodologies. San Diego, CA: Academic Press; 1998.
    1. Zhou J., Thompson D.K., Xu Y., Tiedje J.M. Microbial Functional Genomics. Hoboken, NJ: John Wiley & Sons, Inc.; 2004.
    1. Gene Ontology Consortium. Creating the gene ontology resource: design and implementation. Genome Res. 2001;11:1425–1433. - PMC - PubMed

Publication types