Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 5;17(5):e0268072.
doi: 10.1371/journal.pone.0268072. eCollection 2022.

High quality genome annotation and expression visualisation of a mupirocin-producing bacterium

Affiliations

High quality genome annotation and expression visualisation of a mupirocin-producing bacterium

Anthony S Haines et al. PLoS One. .

Abstract

Pseudomonas strain NCIMB10586, in the P. fluorescens subgroup, produces the polyketide antibiotic mupirocin, and has potential as a host for industrial production of a range of valuable products. To underpin further studies on its genetics and physiology, we have used a combination of standard and atypical approaches to achieve a quality of the genome sequence and annotation, above current standards for automated pathways. Assembly of Illumina reads to a PacBio genome sequence created a retrospectively hybrid assembly, identifying and fixing 415 sequencing errors which would otherwise affect almost 5% of annotated coding regions. Our annotation pipeline combined automation based on related well-annotated genomes and stringent, partially manual, tests for functional features. The strain was close to P. synxantha and P. libaniensis and was found to be highly similar to a strain being developed as a weed-pest control agent in Canada. Since mupirocin is a secondary metabolite whose production is switched on late in exponential phase, we carried out RNAseq analysis over an 18 h growth period and have developed a method to normalise RNAseq samples as a group, rather than pair-wise. To review such data we have developed an easily interpreted way to present the expression profiles across a region, or the whole genome at a glance. At the 2-hour granularity of our time-course, the mupirocin cluster increases in expression as an essentially uniform bloc, although the mupirocin resistance gene stands out as being expressed at all the time points.

PubMed Disclaimer

Conflict of interest statement

I have read the journal’s policy and the authors of this manuscript have the following competing interests: Authors specified as employees of GSK have competing interests because this work was part of a project to develop technology which would have value to GSK; all authors associated with the university of Birmingham have competing interests because the university of Birmingham would benefit financially if the joint project with GSK led to a process that was deemed to be profitable to GSK. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1
Fig 1. Summary of the pipeline used to achieve a high quality genome annotation.
Fig 2
Fig 2. Summary of non-conserved repeats in mmpD showing the sequence information available over the mupirocin cluster for several Pseudomonas strains.
The drawing is not to scale. A high-identity repeat is present covering methyltransferase domains (indicated in pink at top, labelled ‘MT’) of mmpD in NCIMB10586, 2P24 and 8B, and inferred to be present in BRG-100 and NS383, while sequence conservation is lower between the two regions in QS1027. The repeated sequence in each strain is shown as a box with coloured chevron; inferred repeats as a chevron alone. Neither the sequence, nor exact end-point location of these repeats is conserved between strains. Two apparent frameshifts in mmpD are indicated (‘!’) in 2P24, however the overall genome coverage is low and these may be artefacts.
Fig 3
Fig 3. Plot of tile expression intensity vs a measure of variation ([root-mean-square deviation]/mean expression) categorised as determined using the final normalisation factors.
For display in this graph only, expression intensities of zero have been substituted by one to permit calculation of the geometric mean.
Fig 4
Fig 4. Visualisation of expression over the NCIMB10586 genome.
A) Key. The colour of a feature depends on time-point of highest expression (hue), expression intensity (lightness; x axis) and ratio of maximum to minimum expression (saturation; y axis). So: white = consistent low expression; grey = high expression, little variation; intense colour = high expression and big variation between highest and lowest level. B) Features are shown above or below the DNA line dependent on the coding strand. Protein-coding regions are shown as coloured boxes slightly separated from the DNA line. RNA features are shown as coloured boxes adjacent to the DNA line. Putative clusters identified by antiSMASH are shown: grey bar–neighbourhood; black bar(s)—protocluster regions.
Fig 5
Fig 5. Enlarged view of expression over each cluster in the NCIMB10586 genome identified by antiSMASH.
A, B, C, E, G - NRPS; D - Betalactone; F - NAGGN; H - siderophore; I, L - bacteriocin; J – NRPS-like; K - Arylpolyene; M – mupirocin cluster.

Similar articles

Cited by

References

    1. Katz L, Baltz RH. Natural product discovery: past, present, and future. J Ind Microbiol & Biotech 2016; 43:155–176. - PubMed
    1. Fuller AT, Mellows G, Woolford M, Banks GT, Barrow KD, Chain EB. Pseudomonic Acid: an antibiotic produced by Pseudomonas fluorescens. Nature 1971; 234:416–417. doi: 10.1038/234416a0 - DOI - PubMed
    1. Chain EB, Mellows G. Structure of pseudomonic acid, an antibiotic from Pseudomonas fluorescens. J Chem Soc-Chem Comm 1974; 20:847–848.
    1. Curzons AD. Isolation of Pseudomonic Acid. US Patent 1978; 4639534.
    1. Whatling CA, Hodgson JE, Burnham MKR, Clarke NJ, Franklin FCH, Thomas CM. Identification of a 60 kb region of the chromosome of Pseudomonas fluorescens NCIB 10586 required for their biosynthesis of pseudomonic acid (mupirocin). Microbiology 1995; 141:973–982.

Publication types

MeSH terms