Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Oct 21;2(10):e000079.
doi: 10.1099/mgen.0.000079. eCollection 2016 Oct.

Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias

Affiliations

Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias

Lauren A Esposito et al. Microb Genom. .

Abstract

In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.

Keywords: UPGMA clustering; bacteriophages; codon bias; host range; mycobacteria; viral tRNAs.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Histogram of tRNA-encoding genes found in the 199 mycobacteriophages analyzed in this study.
Fig. 2.
Fig. 2.
The synonymous codon usage order (SCUO) was calculated for each genome. SCUO values for each mycobacteriophage were compared with the quantity of encoded tRNA genes. tRNA abundance does not predict SCUO. However, if a phage does have tRNA genes, it tends to have a more biased genome.
Fig. 3.
Fig. 3.
Relative synonymous codon usage (RSCU) was calculated for each mycobacteriophage and mycobacterial species and used to reconstruct this UPGMA-dendrogram. Colors correspond to the cluster designations assigned by phagesdb.org.

References

    1. Andersson G. E., Sharp P. M.(1996). Codon usage in the Mycobacterium tuberculosis complex. Microbiol 142915–925.10.1099/00221287-142-4-915 - DOI - PubMed
    1. Bahir I., Fromer M., Prat Y., Linial M.(2009). Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences. Mol Syst Biol 5311.10.1038/msb.2009.71 - DOI - PMC - PubMed
    1. Bailly-Bechet M., Vergassola M., Rocha E.(2007). Causes for the intriguing presence of tRNAs in phages. Genome Res 171486–1495.10.1101/gr.6649807 - DOI - PMC - PubMed
    1. Basra S., Anany H., Brovko L., Kropinski A. M., Griffiths M. W.(2014). Isolation and characterization of a novel bacteriophage against Mycobacterium avium subspecies paratuberculosis. Arch Virol 1592659–2674.10.1007/s00705-014-2122-3 - DOI - PubMed
    1. Bentley S. D., Parkhill J.(2004). Comparative genomic structure of prokaryotes. Annu Rev Genet 38771–792.10.1146/annurev.genet.38.072902.094318 - DOI - PubMed

Data Bibliography

    1. Alferez, G. I., Bryan, W. J., Byington, E. L., & other authors . NCBI GenBank, http://www.ncbi.nlm.nih.gov/nuccore/JF704105 - GenBank Accession #: JF70...
    1. Bambawale, V., Bieberich, J. C., Borowski, A. L., & other authors . NCBI GenBank, http://www.ncbi.nlm.nih.gov/nuccore/JF704116 - GenBank Accession #: JF70...
    1. Brosch, R., Gordon, S. V., Garnier, T., & other authors. Genome plasticity of BCG and impact on vaccine efficacy. Proc Natl Acad Sci USA 104, 5596-5601. GenBank Accession #: NC_008769 (2007). - PMC - PubMed
    1. Cole, S. T., Brosch, R., Parkhill, J., & other authors. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393, 537-544. GenBank Accession #: NC_000962 (2001). - PubMed
    1. Cole, S. T., Eiglmeier, K., Parkhill, J., & other authors. Massive gene decay in the leprosy bacillus. Nature 409, 1007-1011. GenBank Accession #: NC_002677 (2001). - PubMed

Publication types

LinkOut - more resources