Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Nov 7;114(45):E9549-E9558.
doi: 10.1073/pnas.1706849114. Epub 2017 Oct 24.

Evolutionary and molecular foundations of multiple contemporary functions of the nitroreductase superfamily

Affiliations

Evolutionary and molecular foundations of multiple contemporary functions of the nitroreductase superfamily

Eyal Akiva et al. Proc Natl Acad Sci U S A. .

Abstract

Insight regarding how diverse enzymatic functions and reactions have evolved from ancestral scaffolds is fundamental to understanding chemical and evolutionary biology, and for the exploitation of enzymes for biotechnology. We undertook an extensive computational analysis using a unique and comprehensive combination of tools that include large-scale phylogenetic reconstruction to determine the sequence, structural, and functional relationships of the functionally diverse flavin mononucleotide-dependent nitroreductase (NTR) superfamily (>24,000 sequences from all domains of life, 54 structures, and >10 enzymatic functions). Our results suggest an evolutionary model in which contemporary subgroups of the superfamily have diverged in a radial manner from a minimal flavin-binding scaffold. We identified the structural design principle for this divergence: Insertions at key positions in the minimal scaffold that, combined with the fixation of key residues, have led to functional specialization. These results will aid future efforts to delineate the emergence of functional diversity in enzyme superfamilies, provide clues for functional inference for superfamily members of unknown function, and facilitate rational redesign of the NTR scaffold.

Keywords: enzyme superfamilies; evolution; flavoenzyme; nitroreductase; sequence similarity network.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Fig. 1.
Fig. 1.
An overview of NTR superfamily structure and reaction diversity. (A) A representative NTR structure (PDB ID code 3E39) is depicted in cartoon display in two orientations, with individual monomers colored in gray and red and FMN depicted as a stick model with carbons in yellow (Dataset S1 includes a detailed list of NTR superfamily structures). (B) Diagram showing the ping-pong bi-bi reaction scheme. (C) Representative NTR superfamily reactions: Electron donor (oxidative) reactions, e.g., (1) nicotinamide oxidation, (2) thiazoline oxidation, (3) diketopiperazine oxidation; FMN reduction from (4) oxidized FMN, (5) FMN semiquinone to (6) reduced FMN; electron acceptor (reductive) reactions, e.g., (7) deiodination, (8) quinone reduction, (9) nitroimidazole reduction, (10) ene reduction, and (11) the fragmentation of reduced FMN to dimethylbenzamide.
Fig. 2.
Fig. 2.
A representative SSN of the NTR superfamily: 24,270 protein sequences are depicted by 5,337 nodes (circles), which represent proteins sharing >60% sequence identity. Edges between nodes indicate an average pairwise BLAST E-value of at least 1 × 10−18. Node coloring represents subgroup classification. White nodes with light gray borders indicate remainder sequences that do not belong in any of the categorized subgroups. Large triangle nodes include at least one solved crystal structure; black borders indicate that a biochemical activity was also experimentally characterized. Large circular nodes with black borders include at least one protein associated with experimental evidence (but without structural information). Names in bold indicate subgroups that contain at least one protein with literature-documented functional information. The network is visualized by Cytoscape (74) using the organic layout algorithm (36). (Inset) HMM networks of the NTR superfamily. Nodes represent SSGs (Materials and Methods), and node size correlates with SSG size, from smallest (<100 proteins) to largest (>300 proteins). Edges represent pairwise HMM alignment between SSGs, and similarities with HHALIGN scores >154 (corresponding with an HMM alignment score more significant than 1 × 10−24) are shown. Edge color and width correspond with the HHalign score: <160 indicated by thin and light edges, >300 indicated by thick and dark edges. Nodes are colored based on (A) subgroup and (B) betweenness centrality.
Fig. 3.
Fig. 3.
The NTR superfamily scaffold. (A) The NTR superfamily domain: An overlay of 17 representative NTR structures showing the conserved α+β FMN binding fold that was generated using MUSTANG-MR (76) at a sieving level of 2.0 Å. (B) A 2D topology map of the minimal NTR scaffold colored from blue (N terminus) to red (C terminus) with numbered α-helices and β-strands. (C) A ribbon representation of the hub subgroup structure PDB ID code 3E39 with monomers colored in gray and red, respectively. FMN is depicted in stick form with carbons in yellow. (Inset) Key FMN interacting residues: The FMN moiety and interacting active site residues are displayed in stick form and labeled.
Fig. 4.
Fig. 4.
Structural analysis of the NTR superfamily. (A) A structure similarity network of the NTR superfamily. Each node represents a crystal structure, colored by subgroup as per Fig. 2 (red nodes represent hub subgroup members). Nodes are filled according to the presence or absence of the structural extensions inserted in any of the three hot spot sites, as depicted by the key (Inset). Edges represent pairwise structural similarity scored <0.746, as measured by TM-align. (B) A diagram of the structural diversity observed at the E1, E2, and E3 insertion sites relative to one FMN binding active site of the enzyme. A cartoon representation of a hub protein structure (PDB ID code 3E39) is shown with monomers depicted in gray and red. The locations of the E1, E2, and E3 structural insertion points are indicated by spheres that depict the bordering residues of each insertion (E3 has only one bordering residue, as it extends the C terminus). The FMN molecule is shown in a stick model with carbons colored in yellow. (Inset) Boxes display examples of subgroup specific diversity at each extension site labeled by PDB ID code and subgroup. Extensions are colored by subgroup as per Fig. 2.
Fig. 5.
Fig. 5.
Conservation of FMN-interacting positions across the NTR superfamily. (A) A representative SSN of the NTR superfamily is shown with nodes colored by the most frequent residue type found in the FMN phosphate moiety interacting position. (Inset) Ribbon representation of the active site of hub subgroup structure PDB ID code 3E39 is shown with FMN depicted in stick form with carbons in yellow. The residue (arginine) interacting with the phosphate moiety is circled. (B) A representative SSN of the NTR superfamily is shown with nodes colored by the most frequent residue type found in the re-loop position. (Inset) Ribbon representation of PDB ID code 3E39 (as per A) with the re-loop residue (leucine) circled. Note that the re-loop residue originates from the alternative chain of the homodimer (shown in red).
Fig. 6.
Fig. 6.
A phylogenetic reconstruction of the NTR superfamily. Branches are colored and labeled by subgroup; dispersed red branches represent hub subgroup sequence sets, and black branches represent members of the remainder subgroup. The eight hub SSG-5 branches are labeled (H5). Circles represent branching points with probabilities >0.9; triangles represent probabilities >0.8.

Similar articles

Cited by

References

    1. Gerlt JA, Babbitt PC. Divergent evolution of enzymatic function: Mechanistically diverse superfamilies and functionally distinct suprafamilies. Annu Rev Biochem. 2001;70:209–246. - PubMed
    1. Almonacid DE, Babbitt PC. Toward mechanistic classification of enzyme functions. Curr Opin Chem Biol. 2011;15:435–442. - PMC - PubMed
    1. Brown SD, Babbitt PC. New insights about enzyme evolution from large scale studies of sequence and structure relationships. J Biol Chem. 2014;289:30221–30228. - PMC - PubMed
    1. Sillitoe I, et al. CATH: Comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res. 2015;43:D376–D381. - PMC - PubMed
    1. Horowitz NH. On the evolution of biochemical syntheses. Proc Natl Acad Sci USA. 1945;31:153–157. - PMC - PubMed

Publication types