Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Sep 9:13:465.
doi: 10.1186/1471-2164-13-465.

Complete genome sequence of Saccharothrix espanaensis DSM 44229(T) and comparison to the other completely sequenced Pseudonocardiaceae

Affiliations

Complete genome sequence of Saccharothrix espanaensis DSM 44229(T) and comparison to the other completely sequenced Pseudonocardiaceae

Tina Strobel et al. BMC Genomics. .

Abstract

Background: The genus Saccharothrix is a representative of the family Pseudonocardiaceae, known to include producer strains of a wide variety of potent antibiotics. Saccharothrix espanaensis produces both saccharomicins A and B of the promising new class of heptadecaglycoside antibiotics, active against both bacteria and yeast.

Results: To better assess its capabilities, the complete genome sequence of S. espanaensis was established. With a size of 9,360,653 bp, coding for 8,501 genes, it stands alongside other Pseudonocardiaceae with large genomes. Besides a predicted core genome of 810 genes shared in the family, S. espanaensis has a large number of accessory genes: 2,967 singletons when compared to the family, of which 1,292 have no clear orthologs in the RefSeq database. The genome analysis revealed the presence of 26 biosynthetic gene clusters potentially encoding secondary metabolites. Among them, the cluster coding for the saccharomicins could be identified.

Conclusion: S. espanaensis is the first completely sequenced species of the genus Saccharothrix. The genome discloses the cluster responsible for the biosynthesis of the saccharomicins, the largest oligosaccharide antibiotic currently identified. Moreover, the genome revealed 25 additional putative secondary metabolite gene clusters further suggesting the strain's potential for natural product synthesis.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic representation of the S. espanaensis genome. The genome scale is given in kilobases from the start of dnaA. The two outermost circles show all genes on the forward and the reverse strand, respectively, color-coded according to their predicted COG classes. The next five circles represent the genes S. espanaensis color-coded according to their conservation in the genomes of the other completely sequenced Pseudonocardiaceae. Green denotes genes present in the core genome, red those conserved at least in the two compared species and light blue indicates singletons. The comparison with S. espanaensis was done (from the outside in) with A. mirum, A. mediterranei, S. erythraea, S. viridis and T. bispora. The last two circles represent G + C content and G + C skew ((G-C)/(G + C)), both calculated for a 500 bp window with 100 bp stepping.
Figure 2
Figure 2
Phylogenetic distribution of S. espanaensis proteins based on BlastP hits against the RefSeq database. The amino acid sequences of all predicted CDS in the genome of S. espanaensis were compared against the RefSeq protein database [22] (from August 2011) using BLASTP. The species for each best hit (e-value cutoff 1e-10, hit must cover at least 75% of query and subject) was retrieved and the results were plotted from the least to the most abundantly hit group in the respective taxonomic level. For reasons of clarity, groups with few hits were either lumped together (e.g. under "Other Bacteria") or omitted entirely.
Figure 3
Figure 3
Whole genome comparison of S. espanaensis. To analyze gene synteny, the amino acid sequences of all predicted CDS in the genome of S. espanaensis were compared against those of (A) A. mirum (red) and S. erythraea (green) as well as A. mediterranei (purple) and S. viridis (orange) using the bidirectional BLAST comparison implemented in EDGAR. Aligning all genomes at dnaA, the position of each potential ortholog was then plotted against the position in the S. espanaensis genome. In order to accommodate different genome sizes, the relative position is used for the target genomes.
Figure 4
Figure 4
Development of the core genome of the Pseudonocardiaceae. Using EDGAR, the development of the core genome of the Pseudonocardiaceae was extrapolated by calculating the mean core genome numbers for all possible permutations of genomes (red crosses/line). By non-linear least squares curve fitting, an exponential decay function (dark blue curve and equation) was fitted to the mean core data. A 95% confidence interval was calculated for the fitted model, and the boundaries are displayed (light blue and purple curves). Using the genomes of A. mirum, A. mediterranei, P. dioxanivorans, S. erythraea, S. espanaensis, S. viridis, and T. bispora, a final core genome of approximately 810 genes is predicted, with the current core of the seven analyzed species consisting of 864 genes.
Figure 5
Figure 5
Principle component analysis of the dinucleotide frequencies of the S. espanaensis CDS. A) Using EDGAR, all CDS from S. espanaensis were divided into three groups: "core" (conserved in all six completely sequenced Pseudonocardiaceae; blue "*"), "other" (shared between S. espanaensis and at least one other Pseudonocardiaceae species; green "x") and singletons ("unique" in S. espanaensis; red " + "). For all genes the relative dinucleotide frequencies were calculated, a PCA was performed using the R package and the results for the two main components are plotted. In addition, the median values for all three distributions were calculated and plotted. (B) Using the same calculation as in A, the genes were divided in relation to their position in the genome relative to the origin of replication. Genes close to the oriC (corresponding to the "top half" of the genome) are given as red "x", genes closer to the terminus ("bottom half" of the genome) are depicted as green " + ". Median points are denoted as black "*" and " + ", green and black circles mark the 90% boundaries.
Figure 6
Figure 6
The saccharomicin gene cluster from S. espanaensis. (A) Chemical structures of caffeic acid, taurine, saccharomicin A and B. Fuc, d-fucose; Sac, d-saccharosamine; Rha, l-rhamnose; Eva, l-4-epivancosamine; Dig, l-digitoxose [6]. (B) Organization of the saccharomicin cluster. Proposed functions for individual CDS are summarized in Additional file 1.
Figure 7
Figure 7
S. espanaensis gene clusters for nonribosomal peptide and polyketide biosynthesis. Genes encoding nonribosomal peptide synthases are depicted in dark blue, type I polyketide synthases in red and type II polyketide synthases in orange. The genes involved in the synthesis of putative precursors are highlighted in light green. The remaining genes of the clusters are presented in pale blue. All genes involved in the biosynthesis of an enediyne core in cluster 6 are framed brown. aao, l-amino-acid oxidase; abc, ABC transporter; acc, acyl-CoA carboxylase; acd, acyl-CoA dehydrogenase; acp, acyl carrier protein; acs, acyl-CoA synthetase; act, acyl-CoA transferase; amo, amine oxidase; amt, aminotransferase; ap, aminopeptidase; ask, adenylylsulfate kinase; asl, AMP-dependent synthetase and ligase; ass, sulfate adenylyltransferase; at, acyl transferase; bh, beta-hydroxylase; cbs, carbamoyltransferase; cd, cysteine desulfurase; cho, cholesterol oxidase; cl, chlorinating protein; clf, chain length factor; ct, carboxyltransferase; cys, cysteine synthase; dbp, DNA-binding protein; dc, decarboxylase; dgb, glyoxalase/bleomycin resistance protein/dioxygenase; dh, dehydratase; dhbas, protein involved in the synthesis of activated 2,3-dihydroxybenzoic acid; dhg, dehydrogenase; e/l, esterase/lipase; eci, enoyl-CoA hydratase/isomerase; eff, efflux protein; gsit, glutamine—scyllo-inositol transaminase; gt, glycosyltransferase; hal, histidine ammonia-lyase; hmacps, protein involved in the synthesis of hydroxymalonyl-ACP; hmbppr, 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; hmg, hydroxymethylglutaryl-CoA synthase; hpah, 4-hydroxyphenylacetate-3-hydroxylase; hyd, hydrolase; int, integrase; kr, ketoreductase; ks II, FabF-like protein; ks III, FabH-like protein; lam, lysine 2,3-aminomutase like protein; llp, lipolytic protein; lys, protein involved in lysine synthesis via alpha-aminoadipate; mfs, transporter of the major facilitator superfamily; mmcd, methylmalonyl-CoA decarboxylase; mo, monooxygenase; mt, methyltransferase; mtr, methionyl-tRNA synthetase; npd, 2-nitropropane dioxygenase; ocd, ornithine cyclodeaminase; oxy, oxidoreductase; p450, cytochrome P450; phas, polyhydroxy alkanoic acid synthase; pkc, polyketide cyclase; ppph, 2-polyprenylphenol 6-hydroxylase; pro, protease; reg, regulatory protein; rsam, radical SAM protein; sarp, streptomyces antibiotic regulatory protein; sip, siderophore-interacting protein; tcd, taurine catabolism dioxygenase; te, thioesterase; tetr, protein similar to the tetracycline repressor; tk, transketolase; tn, transposase.

References

    1. Demain AL, Adrio JL. Contributions of microorganisms to industrial biology. Mol Biotechnol. 2008;38:41–55. doi: 10.1007/s12033-007-0035-z. - DOI - PubMed
    1. Baltz RH. Renaissance in antibacterial discovery from actinomycetes. Curr Opin Pharmacol. 2008;8:557–563. doi: 10.1016/j.coph.2008.04.008. - DOI - PubMed
    1. Zitouni A, Boudjella H, Mathieu F, Sabaou N, Lebrihi A. Mutactimycin PR, a new anthracycline antibiotic from Saccharothrix sp. SA 103. I. Taxonomy, fermentation, isolation and biological activities. J Antibiot (Tokyo) 2004;57:367–272. doi: 10.7164/antibiotics.57.367. - DOI - PubMed
    1. Ohuchi T, Ikeda-Araki A, Watanabe-Sakamoto A, Kojiri K, Nagashima M, Okanishi M, Suda H. Cloning and expression of a gene encoding N-glycosyltransferase (ngt) from Saccarothrix aerocolonigenes ATCC39243. J Antibiot (Tokyo) 2000;53:393–403. doi: 10.7164/antibiotics.53.393. - DOI - PubMed
    1. Labeda DP, Kroppenstedt RM. Phylogenetic analysis of Saccharothrix and related taxa: proposal for Actinosynnemataceae fam. nov. Int J Syst Bacteriol. 2000;50:331–336. - PubMed

Publication types

MeSH terms

Substances