Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2005 Nov;1(5):e62.
doi: 10.1371/journal.pgen.0010062. Epub 2005 Nov 18.

Evidence of a large novel gene pool associated with prokaryotic genomic islands

Affiliations

Evidence of a large novel gene pool associated with prokaryotic genomic islands

William W L Hsiao et al. PLoS Genet. 2005 Nov.

Abstract

Microbial genes that are "novel" (no detectable homologs in other species) have become of increasing interest as environmental sampling suggests that there are many more such novel genes in yet-to-be-cultured microorganisms. By analyzing known microbial genomic islands and prophages, we developed criteria for systematic identification of putative genomic islands (clusters of genes of probable horizontal origin in a prokaryotic genome) in 63 prokaryotic genomes, and then characterized the distribution of novel genes and other features. All but a few of the genomes examined contained significantly higher proportions of novel genes in their predicted genomic islands compared with the rest of their genome (Paired t test = 4.43E-14 to 1.27E-18, depending on method). Moreover, the reverse observation (i.e., higher proportions of novel genes outside of islands) never reached statistical significance in any organism examined. We show that this higher proportion of novel genes in predicted genomic islands is not due to less accurate gene prediction in genomic island regions, but likely reflects a genuine increase in novel genes in these regions for both bacteria and archaea. This represents the first comprehensive analysis of novel genes in prokaryotic genomic islands and provides clues regarding the origin of novel genes. Our collective results imply that there are different gene pools associated with recently horizontally transmitted genomic regions versus regions that are primarily vertically inherited. Moreover, there are more novel genes within the gene pool associated with genomic islands. Since genomic islands are frequently associated with a particular microbial adaptation, such as antibiotic resistance, pathogen virulence, or metal resistance, this suggests that microbes may have access to a larger "arsenal" of novel genes for adaptation than previously thought.

PubMed Disclaimer

Conflict of interest statement

Competing interests. The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Proportion of Novel Genes in Genomic Islands (Red Bars) versus the Rest of the Genome (Blue Bars) according to a COG-Based Analysis
Proportions of novel genes are calculated as a percentage of all genes within islands or outside of islands, respectively, for each genome (listed on the x axis). A paired t test indicates that significantly more genes in islands versus non-islands do not have a COG classification (p = 1.20E-18). This phenomenon is uniform across prokaryotic lineages and domains. Similar results are also observed if different datasets are analyzed, or different methods for identifying novel genes are used (Table 3).
Figure 2
Figure 2. Proportion of Novel Genes in Genomic Islands (Red Bars) versus the Rest of the Genome (Blue Bars) according to a SUPERFAMILY-Based Analysis
Proportions of novel genes are calculated as a percentage of all genes within islands or outside of islands, respectively, for each genome (listed on the x axis). A paired t test indicates that significantly higher proportions of genes in islands (red bars) versus outside islands (non-islands; purple bars) do not have a SUPERFAMILY prediction (potential novel genes; p = 4.43E-14).

References

    1. Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, et al. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995;269:496–512. - PubMed
    1. Kunst F, Ogasawara N, Moszer I, Albertini AM, Alloni G, et al. The complete genome sequence of the gram-positive bacterium Bacillus subtilis . Nature. 1997;390:249–256. - PubMed
    1. Blattner FR, Plunkett G, 3rd, Bloch CA, Perna NT, Burland V, et al. The complete genome sequence of Escherichia coli K-12. Science. 1997;277:1453–1474. - PubMed
    1. Bork P. Powers and pitfalls in sequence analysis: The 70% hurdle. Genome Res. 2000;10:398–400. - PubMed
    1. Siew N, Fischer D. Analysis of singleton ORFans in fully sequenced microbial genomes. Proteins. 2003;53:241–251. - PubMed

Publication types