Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Feb 9;2(2):e000043.
doi: 10.1099/mgen.0.000043. eCollection 2016 Feb.

Polysaccharide utilization loci and nutritional specialization in a dominant group of butyrate-producing human colonic Firmicutes

Affiliations

Polysaccharide utilization loci and nutritional specialization in a dominant group of butyrate-producing human colonic Firmicutes

Paul O Sheridan et al. Microb Genom. .

Abstract

Firmicutes and Bacteroidetes are the predominant bacterial phyla colonizing the healthy human large intestine. Whilst both ferment dietary fibre, genes responsible for this important activity have been analysed only in the Bacteroidetes, with very little known about the Firmicutes. This work investigates the carbohydrate-active enzymes (CAZymes) in a group of Firmicutes, Roseburia spp. and Eubacterium rectale, which play an important role in producing butyrate from dietary carbohydrates and in health maintenance. Genome sequences of 11 strains representing E. rectale and four Roseburia spp. were analysed for carbohydrate-active genes. Following assembly into a pan-genome, core, variable and unique genes were identified. The 1840 CAZyme genes identified in the pan-genome were assigned to 538 orthologous groups, of which only 26 were present in all strains, indicating considerable inter-strain variability. This analysis was used to categorize the 11 strains into four carbohydrate utilization ecotypes (CUEs), which were shown to correspond to utilization of different carbohydrates for growth. Many glycoside hydrolase genes were found linked to genes encoding oligosaccharide transporters and regulatory elements in the genomes of Roseburia spp. and E. rectale, forming distinct polysaccharide utilization loci (PULs). Whilst PULs are also a common feature in Bacteroidetes, key differences were noted in these Firmicutes, including the absence of close homologues of Bacteroides polysaccharide utilization genes, hence we refer to Gram-positive PULs (gpPULs). Most CAZyme genes in the Roseburia/E. rectale group are organized into gpPULs. Variation in gpPULs can explain the high degree of nutritional specialization at the species level within this group.

Keywords: Carbohydrate; Lachnospiraceae; Roseburia; comparative genomics; gut microbiota; obligate anaerobe.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Growth of Roseburia/E. rectale strains on selected carbohydrates in microtitre plates. (a) Heatmap representing the mean maximum OD650 obtained by a strain during growth on a specific carbohydrate (OD650 0.0–1.6). Growth was observed on fructo-oligosaccharide (FOS), galacto-oligosaccharide (GOS), xylo-oligosaccharide (XOS), amylopectin (AP), amylose (A), 1,3–1,4-β-glucan (B-glu), arabinoxylan (AX), type 1 arabinogalactan (AG1) and inulin (I). No growth was observed for any the 11 strains on β-mannan, xyloglucan, type 2 arabinogalactan, mucin core type 2 or mucin core type 3. Data plotted in graphs are the mean ± sd OD650 readings of six replicates of strains grown on (b) 0.5 % arabinoxylan or (c) 0.5 % type 1 arabinogalactan. Full growth data are presented in Table S3.
Fig. 2.
Fig. 2.
Phylogenetic tree of Roseburia/E. rectale GH13s. Gene names are colour-coded based on KEGG GH annotation. Strongly supported clades (bootstrap ≥ 90) are coloured at their most proximal branch. The branch colour corresponds to the KEGG GH annotation of the genes within the clade. Colour coding is as follows: neopullulanase (EC 3.2.1.135) (light blue), cyclomaltodextrinase (EC 3.2.1.54) (orange), α-glucosidase (EC 3.2.1.3) (brown), α-amylase (EC 3.2.1.1) (red), sucrose phosphorase (EC 3.2.1.7) (pink), oligo-1,6-glucosidase (EC 3.2.1.10) (blue), pullulanase (EC 3.2.1.41) (green), glycogen-debranching enzyme (EC 3.2.1.-) (purple) and malto-oligosyltrehalose trehalohydrolase (EC 3.2.1.141) (gold). Bootstrap values, expressed as a percentage of 1000 replications, are given at the branching nodes. This tree is unrooted and reconstructed using the maximum-likelihood method. The scale bar refers to the number of amino acid differences per position. Clades of core GH13s are indicated by asterisks at their most proximal branch.
Fig. 3.
Fig. 3.
(a) Principal coordinate analysis of Roseburia/E. rectale strains based on complement of GH families and (b) heatmap showing CUEs. Values of a given GH family or carbohydrate set were taken as the number of these genes each genome possessed. In (a), coordinates were calculated using Kendal τ distance applied to the first five eigenvectors. R. intestinalis strains L1-82, M50/1 and XB6B4 (orange); R. inulinivorans strains A2-194 and L1-83 (blue); R. faecis M72/1 and R. hominis A2-183 (green); and E. rectale strains A1-86, T1-815, M104/1 and ATCC33656 (red) form separate clusters (P < 0.001, non-parametric multivariate ANOVA). In (b), CUEs were determined by complete linkage clustering using Kendall τ (as shown) and Spearman distance (not shown). GH53, an endo-1,4-β-galactanase that cleaves the β-1,4-d-galactosidic linkages in type I arabinogalactans, was assigned to the carbohydrate set ‘Type 1 arabinogalactan’ and is excluded from ‘Xylans and Arabinans’, ‘Pectins’ and ‘Alpha- and Beta-galactosides’. CUE1 consists of R. faecis M72/1 and R. hominis A2-183. CUE2 consists of R. intestinalis strains L1-82, M50/1 and XB6B4. CUE3 consists of E. rectale strains A1-86, T1-815, M104/1 and ATCC33656. CUE4 consists of R. inulinivorans strains A2-194 and L1-83. GH assignment to each carbohydrate set is described in Table S6.
Fig. 4.
Fig. 4.
Schematic representation of gpPULs concerned with xylan and arabinogalactan utilization. Glycoside hydrolase genes are coloured green. Carbohydrate esterase genes are coloured blue. ABC-transporter system component genes are coloured red. Transcriptional regulator genes are coloured yellow. Uncharacterized transporter genes are coloured black. Hypothetical genes are coloured white. Xylose isomerase genes are coloured grey. Two parallel black bars between genes indicate sections that are separated in the genome sequence. Roseburia/E. rectale strains not represented in the diagram lack an orthologous gpPUL. Genes located vertically to each other are orthologues. Solid blue lines between genes are for easy visual comparison of the genes between species and do not represent real gaps in the genome. Locus tags of gpPULs are listed in Table S7.
Fig. 5.
Fig. 5.
Schematic representation of gpPULs concerned with fructan utilization. GH genes are coloured green. ABC transporter system component genes are coloured red. Transcriptional regulator genes are coloured yellow. Fructokinase genes are coloured purple. Hypothetical genes are coloured white. Any of the 11 Roseburia/E. rectale strains not represented in the diagram lack an orthologous gpPUL. Genes located vertically to each other are orthologues. The diagonal line through the R. faecis gene represents a frameshift mutation. Locus tags of gpPULs are listed in Table S7.
Fig. 6.
Fig. 6.
Schematic representation of gpPULs concerned with host-derived carbohydrates. GH genes are coloured green. ABC transporter system component genes are coloured red. The polysaccharide lyase gene is coloured bright yellow. Mucin desulphatase genes are coloured gold. Hypothetical genes are coloured white. Two-component signal transduction component genes consisting of a histidine kinase and a response regulator containing a CheY-like receiver domain and an AraC-like DNA-binding domain are coloured navy. Solid blue lines between genes are for easy visual comparison of the genes between species and do not represent real gaps in the genome. Any of the 11 Roseburia/E. rectale strains not represented in the diagram lack an orthologous gpPUL. Locus tags of gpPULs are listed in Table S7.

References

    1. Alikhan N. F., Petty N. K., Ben Zakour N. L., Beatson S. A. (2011). blast Ring Image Generator (brig): simple prokaryote genome comparisons BMC Genomics 12402. 10.1186/1471-2164-12-402. - DOI - PMC - PubMed
    1. Aminov R. I., Walker A. W., Duncan S. H., Harmsen H. J., Welling G. W., Flint H. J. (2006). Molecular diversity, cultivation, and improved detection by fluorescent in situ hybridization of a dominant group of human gut bacteria related to Roseburia spp. or Eubacterium rectale Appl Environ Microbiol 726371–6376. - PMC - PubMed
    1. Balamurugan R., Rajendiran E., George S., Samuel G. V., Ramakrishna B. S. (2008). Real-time polymerase chain reaction quantification of specific butyrate-producing bacteria, Desulfovibrio and Enterococcus faecalis in the feces of patients with colorectal cancer J Gastroenterol Hepatol 231298–1303. - PubMed
    1. Barcenilla A., Pryde S. E., Martin J. C., Duncan S. H., Stewart C. S., Henderson C., Flint H. J. (2000). Phylogenetic relationships of butyrate-producing bacteria from the human gut Appl Environ Microbiol 661654–1661 10.1128/AEM.66.4.1654-1661.2000. - DOI - PMC - PubMed
    1. Ben David Y., Dassa B., Borovok I., Lamed R., Koropatkin N. M., Martens E. C., White B. A., Bernalier-Donadille A., Duncan S. H., other authors (2015). Ruminococcal cellulosome systems from rumen to human Environ Microbiol 173407–3426 10.1111/1462-2920.12868. - DOI - PubMed

Data References

    1. Eubacterium rectale T1-815 genome (2015); CVRQ01000001–CVRQ01000090: http://www.ebi.ac.uk/ena/data/view/PRJEB9320.
    1. Roseburia faecis M72/1 genome (2015); CVRR01000001–CVRR01000101: http://www.ebi.ac.uk/ena/data/view/PRJEB9321.
    1. Roseburia inulinivorans L1-83 genome (2015); CVRS01000001–CVRS01000151: http://www.ebi.ac.uk/ena/data/view/PRJEB9322.

Publication types