Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Nov 27;5(4):1121-31.
doi: 10.1016/j.celrep.2013.10.031. Epub 2013 Nov 21.

Genome-wide mapping of transcriptional start sites defines an extensive leaderless transcriptome in Mycobacterium tuberculosis

Affiliations

Genome-wide mapping of transcriptional start sites defines an extensive leaderless transcriptome in Mycobacterium tuberculosis

Teresa Cortes et al. Cell Rep. .

Erratum in

  • Cell Rep. 2014 Jan 30;6(2):415

Abstract

Deciphering physiological changes that mediate transition of Mycobacterium tuberculosis between replicating and nonreplicating states is essential to understanding how the pathogen can persist in an individual host for decades. We have combined RNA sequencing (RNA-seq) of 5' triphosphate-enriched libraries with regular RNA-seq to characterize the architecture and expression of M. tuberculosis promoters. We identified over 4,000 transcriptional start sites (TSSs). Strikingly, for 26% of the genes with a primary TSS, the site of transcriptional initiation overlapped with the annotated start codon, generating leaderless transcripts lacking a 5' UTR and, hence, the Shine-Dalgarno sequence commonly used to initiate ribosomal engagement in eubacteria. Genes encoding proteins with active growth functions were markedly depleted from the leaderless transcriptome, and there was a significant increase in the overall representation of leaderless mRNAs in a starvation model of growth arrest. The high percentage of leaderless genes may have particular importance in the physiology of nonreplicating M. tuberculosis.

PubMed Disclaimer

Figures

None
Graphical abstract
Figure 1
Figure 1
Mapping of TSSs in M. tuberculosis (A) Genome visualization shows the transcriptional map of M. tuberculosis. Moving from the outer to innermost ring, forward genes are indicated in green and reverse genes in purple; TSS mapping is in orange, and whole-transcriptome expression is in light blue. Circular map was generated using Circos (Krzywinski et al., 2009). (B) The 50 bp upstream sequences of the 1,779 primary TSSs detected were used for motif discovery using MEME. A conserved −10 sequence with consensus TANNNT was found in 73% of the upstream promoter sequences; 7% of these had an extended −10 motif of TGNTANNNT. (C) The extended −10 motif (TGNTANNNT) was associated with maximal promoter activity measured by TSS peak height. Box plots indicate median (horizontal line), interquartile range (box), and minimum and maximum values (whiskers). Statistically significant differences are indicated for p values of p < 0.05, ∗∗p < 0.01, and ∗∗∗p < 0.001. (D) Base preference at the transcription initiation start point is presented. The percent representation of each base is shown for positions −1, +1, and +2 among the 1,779 primary TSSs. See also Figure S1 and Table S1.
Figure 2
Figure 2
Leaderless Genes Are Expressed at a Lower Level during Exponential Growth (A) and (C) show box plots indicating median (horizontal line), interquartile range (box), and minimum and maximum values (whiskers). Statistically significant differences are indicated (∗∗∗p < 0.001). (A) RNA levels (RPKM values) during exponential growth across leaderless (L), Shine-Dalgarno (SD), and UTR genes (identified on the basis of a primary TSS or operon organization) are shown. See also Figure S2 for a similar analysis using more stringent criteria for identification of Shine-Dalgarno genes. (B) Correlation between promoter expression (primary TSS strength) and the associated downstream gene (RPKM values) for leaderless (orange) and leader (gray) genes is shown. (C) Protein levels (measured as ion counts) during exponential growth across leaderless, Shine-Dalgarno, and UTR genes are shown. y Axis represents ion counts rescaled by dividing by 106 in order to reduce axis values. (D) Correlation between protein abundance and mRNA expression is presented. The plot shows an overlay of genes encoded on leaderless (orange), Shine-Dalgarno (blue), and UTR (gray) transcripts. Plots for individual transcript categories are included in Figure S2. See also Figure S2 and Tables S1 and S4.
Figure 3
Figure 3
Leaderless and Shine-Dalgarno Transcripts Are Differentially Represented among Different Functional Gene Classes The distribution of genes encoded by leaderless and Shine-Dalgarno transcripts across different functional classes was compared among the expressed transcriptome representation. Values on the x axis represent a difference in percentage; positive values indicate overrepresentation of a particular functional class with respect to whole-transcriptome representation, whereas negative values indicate underrepresentation. All functional categories shown were statistically significant after chi-square test analyses, and patterned bars further denote the functional categories that remained significant after multiple testing correction. ND, no difference. See also Table S1.
Figure 4
Figure 4
Differential Expression of Leaderless mRNAs in Response to Starvation (A) There was a significant increase in the median level of expression of leaderless mRNAs after 24 hr starvation measured by RPKM for whole genes. (B)–(D) show Artemis traces illustrating TSS mapping for genes strongly upregulated in the starvation model. Bars record the normalized number of mapped reads; the maximum normalized read count is indicated on top of each bar. The position of the TSS corresponds to the left-hand edge of the bar for transcripts in the forward orientation (shown in blue) and to the right-hand edge for transcripts in the reverse orientation (red). In each panel, the top trace is from exponential growth, and the lower trace shows TSS profiles after 24 hr starvation. The genome location is shown at the bottom of each panel. (B) Transcription of sigE is driven by three TSSs in exponential phase; only the middle TSS—generating a leaderless mRNA—is upregulated in response to starvation. (C) Upregulation of lat (Rv3289c) is accompanied by upregulation of an antisense transcript to lrpA (Rv3290c). Closely juxtaposed divergent promoters are a common feature of the M. tuberculosis transcriptome. (D) PrpD (Rv1130) and its adjacent divergently oriented regulator (Rv1129c) are both upregulated in response to starvation; insets show mapping of TSS to Met 26 and Val 9 start codons, generating leaderless mRNAs. (E) Bar charts indicate the percentage of proteins with abundance ratios (starvation/exponential growth) greater than 1.1 and less than 0.9 for leaderless, Shine-Dalgarno, and UTR categories. ns, not significant. See also Tables S1, S4, S5, and S6.

References

    1. Agarwal N., Tyagi A.K. Role of 5′-TGN-3′ motif in the interaction of mycobacterial RNA polymerase with a promoter of ‘extended -10’ class. FEMS Microbiol. Lett. 2003;225:75–83. - PubMed
    1. Albrethsen J., Agner J., Piersma S.R., Højrup P., Pham T.V., Weldingh K., Jimenez C.R., Andersen P., Rosenkrands I. Proteomic profiling of Mycobacterium tuberculosis identifies nutrient-starvation-responsive toxin-antitoxin systems. Mol. Cell. Proteomics. 2013;12:1180–1191. - PMC - PubMed
    1. Anders S., Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106. - PMC - PubMed
    1. Arnvig K., Young D. Non-coding RNA and its potential role in Mycobacterium tuberculosis pathogenesis. RNA Biol. 2012;9:427–436. - PMC - PubMed
    1. Arnvig K.B., Comas I., Thomson N.R., Houghton J., Boshoff H.I., Croucher N.J., Rose G., Perkins T.T., Parkhill J., Dougan G., Young D.B. Sequence-based analysis uncovers an abundance of non-coding RNA in the total transcriptome of Mycobacterium tuberculosis. PLoS Pathog. 2011;7:e1002342. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources