Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 8;11(12):e0168035.
doi: 10.1371/journal.pone.0168035. eCollection 2016.

Analysis of Domain Architecture and Phylogenetics of Family 2 Glycoside Hydrolases (GH2)

Affiliations

Analysis of Domain Architecture and Phylogenetics of Family 2 Glycoside Hydrolases (GH2)

David Talens-Perales et al. PLoS One. .

Abstract

In this work we report a detailed analysis of the topology and phylogenetics of family 2 glycoside hydrolases (GH2). We distinguish five topologies or domain architectures based on the presence and distribution of protein domains defined in Pfam and Interpro databases. All of them share a central TIM barrel (catalytic module) with two β-sandwich domains (non-catalytic) at the N-terminal end, but differ in the occurrence and nature of additional non-catalytic modules at the C-terminal region. Phylogenetic analysis was based on the sequence of the Pfam Glyco_hydro_2_C catalytic module present in most GH2 proteins. Our results led us to propose a model in which evolutionary diversity of GH2 enzymes is driven by the addition of different non-catalytic domains at the C-terminal region. This model accounts for the divergence of β-galactosidases from β-glucuronidases, the diversification of β-galactosidases with different transglycosylation specificities, and the emergence of bicistronic β-galactosidases. This study also allows the identification of groups of functionally uncharacterized protein sequences with potential biotechnological interest.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Domain architectures of GH2 family.
Numbers in parentheses indicate the number of sequences representative of each DA type, included in the Pfam database.
Fig 2
Fig 2. Domain architecture of DA type 5 sequences.
Colored and stripped boxes correspond to domains identified by Pfam and Interpro databases, respectively. Modules with more than 40% sequence identity compared to BIG1 domains identified by Interpro, with a coverage higher than 60%, were tagged as BIG1. Letters (a-i) on the right edge of the figure group sequences with similar DAs. Domain assignment at the I1, I2, and I3 regions (subtype a) was based on the analysis carried out with the β-galactosidase from S. pneumoniae [30]. Numbers on top of non-identified regions indicate approximate number of residues.
Fig 3
Fig 3. Phylogenetic analysis of the GH2C domain.
The tree was calculated by Maximum Likelihood method based on the JTT matrix-based model [21] condensed at < 50% bootstrap support. The analysis involved a selection of 380 amino acid sequences for the different DAs. The tree was drawn using FigTree software (http://tree.bio.ed.ac.uk/software/figtree/). Numbers indicate the DA type. The asterisk marks the subtree further analysed in Fig 4. Sectors corresponding to a single DA type are colored in turquoise (DA type 1), purple (DA type 2), blue (DA type 3), red (DA type 4). Cluster 5* is colored in light grey. Sectors including mixed DA types are colored in dark grey.
Fig 4
Fig 4. Phylogenetic subtree corresponding to the region marked 5* in Fig 3.
Letters on the right edge of the figure group sequences with similar DAs, as indicated in Fig 2. Branch numbers indicate bootstrap values. Bc and Bf correspond to Bacillus circulans and Bifidobacterium bifidum β-galactosidases, respectively.
Fig 5
Fig 5
Docking of a DA type 5 (A) and a DA type 3 (B) β-galactosidase with their main transglycosylation products. The figure shows the domains that compose the architecture of the enzymes, represented in different colors. The residues potentially interacting with β-D-(1,4)-galactosyl-lactose in Bacillus circulans β-galactosidase (A) or with β-D-(1,3)-galactosyl-lactose in Thermotoga maritima β-galactosidase (B) are highlighted on the right side.
Fig 6
Fig 6. Sequence alignment around the putative catalytic site of proteins analyzed in Fig 4.
Purple boxes indicate residues potentially involved in the active site, as predicted by docking analysis with β-D-(1,4)-galactosyl-lactose. Positions with more than 50% identity are colored in light blue and those with 100% identity are shown in dark blue color. Bc and Bf correspond to Bacillus circulans and Bifidobacterium bifidum β-galactosidases, respectively.
Fig 7
Fig 7. Proposed evolutionary model for GH2 enzymes.

References

    1. Adam AC, Rubio-Texeira M, Polaina J. Lactose: The milk sugar from a biotechnological perspective. Crit Rev Food Sci Nutr. 2004;44: 553–557. - PubMed
    1. Husain Q. Beta galactosidases and their potential applications: A review. Crit Rev Biotechnol. 2010;30: 41–62. 10.3109/07388550903330497 - DOI - PubMed
    1. Petzelbauer I, Reiter A, Splechtna B, Kosma P, Nidetzky B. Transgalactosylation by thermostable beta-glycosidases from Pyrococcus furiosus and Sulfolobus solfataricus. Binding interactions of nucleophiles with the galactosylated enzyme intermediate make major contributions to the formation of new beta-glycosides during lactose conversion. Eur J Biochem. 2000;267: 5055–66. - PubMed
    1. Talens-Perales D, Polaina J, Marin-Navarro J. Structural dissection of the active site of Thermotoga maritima beta-galactosidase identifies key residues for transglycosylating activity. J Agr Food Chem. 2016;64: 2917–24. - PubMed
    1. Torres DPM, Gonçalves MdPF, Teixeira J, Rodrigues LR. Galacto-oligosaccharides: Production, properties, applications, and significance as prebiotics. Compr Rev Food Sci F. 2010;9: 438–454. - PubMed

LinkOut - more resources