Computational pipeline for sustainable enzyme discovery through (re)use of metagenomic data
- PMID: 40252419
- DOI: 10.1016/j.jenvman.2025.125381
Computational pipeline for sustainable enzyme discovery through (re)use of metagenomic data
Abstract
Enzymes derived from extremophilic organisms, also known as extremozymes, offer sustainable and efficient solutions for industrial applications. Valued for their resilience and low environmental impact, extremozymes have found use as catalysts in various processes, ranging from dairy production to pharmaceutical manufacturing. However, discovery of novel extremozymes is often hindered by challenges such as culturing difficulties, underrepresentation of extreme environments in reference databases, and limitations of traditional sequence-based screening methods. In this work, we present a computational pipeline designed to discover novel enzymes from metagenomic data derived from extreme environments. This pipeline represents a versatile and sustainable approach that promotes reuse and recycling of existing datasets and minimises the need for additional environmental sampling. In its core, the algorithm integrates both traditional bioinformatic techniques and recent advances in structural prediction, enabling rapid and accurate identification of enzymes. However, due to its design, the algorithm relies heavily on existing databases, which can limit its effectiveness in situations where reference data is scarce or when encountering novel protein families. As a proof-of-concept, we applied the pipeline to metagenomic data from deep-sea hydrothermal vents, with a focus on β-galactosidases. The pipeline identified 11 potential candidate proteins, out of which 10 showed in vitro activity. One of the selected enzymes, βGal_UW07, showed strong potential for industrial applications. The enzyme exhibited optimal activity at 70 °C and was exceptionally resistant to high pH and the presence of metal ions and reducing agents. Overall, our results indicate that the pipeline is highly accurate and can play a key role in sustainable bioprospecting, leveraging existing metagenomic datasets and minimising in situ interventions in pristine regions.
Keywords: Arctic; Bioprospecting; Extremophile; Hydrothermal vent; Protein structure; β-galactosidase.
Copyright © 2025 The Authors. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Cold and Hot Extremozymes: Industrial Relevance and Current Trends.Front Bioeng Biotechnol. 2015 Oct 20;3:148. doi: 10.3389/fbioe.2015.00148. eCollection 2015. Front Bioeng Biotechnol. 2015. PMID: 26539430 Free PMC article. Review.
-
Discovery of novel enzymes with industrial potential from a cold and alkaline environment by a combination of functional metagenomics and culturing.Microb Cell Fact. 2014 May 20;13:72. doi: 10.1186/1475-2859-13-72. Microb Cell Fact. 2014. PMID: 24886068 Free PMC article.
-
Bioprospecting of Novel Extremozymes From Prokaryotes-The Advent of Culture-Independent Methods.Front Microbiol. 2021 Feb 10;12:630013. doi: 10.3389/fmicb.2021.630013. eCollection 2021. Front Microbiol. 2021. PMID: 33643258 Free PMC article. Review.
-
A community-supported metaproteomic pipeline for improving peptide identifications in hydrothermal vent microbiota.Brief Bioinform. 2021 Sep 2;22(5):bbab052. doi: 10.1093/bib/bbab052. Brief Bioinform. 2021. PMID: 33834201
-
Going to extremes - a metagenomic journey into the dark matter of life.FEMS Microbiol Lett. 2021 Jun 24;368(12):fnab067. doi: 10.1093/femsle/fnab067. FEMS Microbiol Lett. 2021. PMID: 34114607
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials