Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Apr 30;32(8):2353-61.
doi: 10.1093/nar/gkh555. Print 2004.

Identification and functional analysis of 'hypothetical' genes expressed in Haemophilus influenzae

Affiliations

Identification and functional analysis of 'hypothetical' genes expressed in Haemophilus influenzae

Eugene Kolker et al. Nucleic Acids Res. .

Abstract

The progress in genome sequencing has led to a rapid accumulation in GenBank submissions of uncharacterized 'hypothetical' genes. These genes, which have not been experimentally characterized and whose functions cannot be deduced from simple sequence comparisons alone, now comprise a significant fraction of the public databases. Expression analyses of Haemophilus influenzae cells using a combination of transcriptomic and proteomic approaches resulted in confident identification of 54 'hypothetical' genes that were expressed in cells under normal growth conditions. In an attempt to understand the functions of these proteins, we used a variety of publicly available analysis tools. Close homologs in other species were detected for each of the 54 'hypothetical' genes. For 16 of them, exact functional assignments could be found in one or more public databases. Additionally, we were able to suggest general functional characterization for 27 more genes (comprising approximately 80% total). Findings from this analysis include the identification of a pyruvate-formate lyase-like operon, likely to be expressed not only in H.influenzae but also in several other bacteria. Further, we also observed three genes that are likely to participate in the transport and/or metabolism of sialic acid, an important component of the H.influenzae lipo-oligosaccharide. Accurate functional annotation of uncharacterized genes calls for an integrative approach, combining expression studies with extensive computational analysis and curation, followed by eventual experimental verification of the computational predictions.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Sequence alignment of HI0521 and related proteins with pyruvate-formate lyases.

References

    1. Fleischmann R.D., Adams,M.D., White,O., Clayton,R.A., Kirkness,E.F., Kerlavage,A.R., Bult,C.J., Tomb,J.-F., Dougherty,B.A., Merrick,J.M. et al. (1995) Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science, 269, 496–512. - PubMed
    1. Drell D. (2002) The Department of Energy microbial cell project: a 180 degrees paradigm shift for biology. OMICS, 6, 3–9. - PubMed
    1. Koonin E.V. and Galperin,M.Y. (2002) Sequence–Evolution–Function. Computational Approaches in Comparative Genomics. Kluwer Academic, Boston, MA. - PubMed
    1. Koonin E.V., Mushegian,A.R., Galperin,M.Y. and Walker,D.R. (1997) Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea. Mol. Microbiol., 25, 619–637. - PubMed
    1. Brenner S.E. (1999) Errors in genome annotation. Trends Genet., 15, 132–133. - PubMed

Publication types