Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jul 13;32(12):3689-702.
doi: 10.1093/nar/gkh694. Print 2004.

Genome-wide operon prediction in Staphylococcus aureus

Affiliations

Genome-wide operon prediction in Staphylococcus aureus

Liangsu Wang et al. Nucleic Acids Res. .

Abstract

Identification of operon structure is critical to understanding gene regulation and function, and pathogenesis, and for identifying targets towards the development of new antibiotics in bacteria. Recently, the complete genome sequences of a large number of important human bacterial pathogens have become available for computational analysis, including the major human Gram-positive pathogen Staphylococcus aureus. By annotating the predicted operon structure of the S.aureus genome, we hope to facilitate the exploration of the unique biology of this organism as well as the comparative genomics across a broad range of bacteria. We have integrated several operon prediction methods and developed a consensus approach to score the likelihood of each adjacent gene pair to be co-transcribed. Gene pairs were separated into distinct operons when scores were equal to or below an empirical threshold. Using this approach, we have generated a S.aureus genome map with scores annotated at the intersections of every adjacent gene pair. This approach predicted about 864 monocistronic transcripts and 533 polycistronic operons from the protein-encoding genes in the S.aureus strain Mu50 genome. When compared with a set of experimentally determined S.aureus operons from literature sources, this method successfully predicted at least 91% of gene pairs. At the transcription unit level, this approach correctly identified at least 92% of complete operons in this dataset. This consensus approach has enabled us to predict operons with high accuracy from a genome where limited experimental evidence for operon structure is available.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A flow chart of operon annotations in S.aureus by integrating various algorithms.
Figure 2
Figure 2
Comparisons of frequency distance distributions of adjacent gene pairs in the same orientations in various bacterial genomes.
Figure 3
Figure 3
Distance distribution of S.aureus Rho-independent transcription terminators predicted by GCG Terminator software from Wisconsin Package™.
Figure 4
Figure 4
Comparison of nrdIEF operon across several bacterial organisms. Arrows represent the directions of the genes. A, nrdA; B, nrdB; I, nrdI; E, nrdE; F, nrdF. The double line represents the genome.

References

    1. Jacob F. and Monod,J. (1961) Genetic regulatory mechanisms in the synthesis of proteins. J. Mol. Biol., 3, 318–356. - PubMed
    1. Burton Z.F., Gross,C.A., Watanabe,K. and Burgess,R.R. (1983) The operon that encodes the sigma subunit of RNA polymerase also encodes ribosomal protein S21 and DNA primase in E.coli K12. Cell, 32, 335–349. - PubMed
    1. Little R., Fiil,N.P., Dennis,P.P. (1981) Transcriptional and post-transcriptional control of ribosomal protein and ribonucleic acid polymerase genes. J. Bacteriol., 147, 25–35. - PMC - PubMed
    1. Blattner F.R., Plunkett,G.,III, Bloch,C.A., Perna,N.T., Burland,V., Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G.F. et al. (1997) The complete genome sequence of Escherichia coli K-12. Science, 277, 1453–1474. - PubMed
    1. Zurawski G., Zurawski,S.M. (1985) Structure of the Escherichia coli S10 ribosomal protein operon. Nucleic Acids Res., 13, 4521–4526. - PMC - PubMed

Publication types