Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jun 2;41(6):btaf307.
doi: 10.1093/bioinformatics/btaf307.

ASMC: investigating the amino acid diversity of enzyme active sites

Affiliations

ASMC: investigating the amino acid diversity of enzyme active sites

Thomas Bailly et al. Bioinformatics. .

Abstract

Motivation: The analysis of enzyme active sites is essential for understanding their activity in terms of catalyzed reaction and substrate specificity, providing insights for engineering to obtain targeted properties or modify the substrate scope. In 2010, a first version of the Active Site Modeling and Clustering (ASMC) workflow was published. ASMC predicts isofunctional clusters from enzyme families, based on structural modeling and clustering of active sites. Since then, structure- and sequence-based methods have developed considerably.

Results: We present here a redesign of the ASMC workflow. This new major version includes recent pocket prediction, structural alignment and clustering methods, as well as a refined amino acid distance matrix, thereby improving the relevance of results and reducing the need for laborious manual analysis to obtain relevant clusters. In addition, we have implemented multiple sequence alignment as a possible input for the clustering step, along with an additional script to compare 2D and 3D active sites. Finally, the code has been unified from three to one programming language (Python) to facilitate its installation and maintenance. This new version of ASMC was evaluated on a set of protein families, resulting in overall better performances compared to its original version.

Availability and implementation: ASMC is supported on Linux operating system and freely available at https://github.com/labgem/ASMC, along with a complete documentation (wiki, tutorial).

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic view of the updated ASMC workflow. Each colored frame represents a way to run ASMC: (A) whether the active site(s) is/are unknown or (B) known, (C) whether the 3D models are already available, (D) whether users need to re-cluster or sub-cluster a set of aligned active sites, or (E) whether the input is a multiple sequence alignment (2D-based approach). Software names are written in bold.

Similar articles

References

    1. Bastard K, Isabet T, Stura EA et al. Structural studies based on two lysine dioxygenases with distinct regioselectivity brings insights into enzyme specificity within the clavaminate synthase-like family. Sci Rep 2018;8:16587. - PMC - PubMed
    1. Bastard K, Perret A, Mariage A et al. Parallel evolution of non-homologous isofunctional enzymes in methionine biosynthesis. Nat Chem Biol 2017;13:858–66. - PubMed
    1. Bastard K, Smith AAT, Vergne-Vaxelaire C et al. Revealing the hidden functional diversity of an enzyme family. Nat Chem Biol 2014;10:42–9. - PubMed
    1. Brown DP, Krishnamurthy N, Sjölander K et al. Automated protein subfamily identification and classification. PLoS Comput Biol 2007;3:e160. - PMC - PubMed
    1. Cao T, Li Q, Huang Y et al. plotnineSeqSuite: a Python package for visualizing sequence data using ggplot2 style. BMC Genomics 2023;24:585. - PMC - PubMed