Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jan 29;6(1):e1000661.
doi: 10.1371/journal.pcbi.1000661.

Automatic assignment of EC numbers

Affiliations

Automatic assignment of EC numbers

Volker Egelhofer et al. PLoS Comput Biol. .

Abstract

A wide range of research areas in molecular biology and medical biochemistry require a reliable enzyme classification system, e.g., drug design, metabolic network reconstruction and system biology. When research scientists in the above mentioned areas wish to unambiguously refer to an enzyme and its function, the EC number introduced by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (IUBMB) is used. However, each and every one of these applications is critically dependent upon the consistency and reliability of the underlying data for success. We have developed tools for the validation of the EC number classification scheme. In this paper, we present validated data of 3788 enzymatic reactions including 229 sub-subclasses of the EC classification system. Over 80% agreement was found between our assignment and the EC classification. For 61 (i.e., only 2.5%) reactions we found that their assignment was inconsistent with the rules of the nomenclature committee; they have to be transferred to other sub-subclasses. We demonstrate that our validation results can be used to initiate corrections and improvements to the EC number classification scheme.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Processes for reconstruction of a metabolic network.
Figure 2
Figure 2. Examples for the different subsets.
(a) The reverse direction of the reaction is shown. (b) Ambiguous, fits more than one sub-subclass. (c) Reaction is assigned to a wrong sub-subclass. (d) The enzyme catalysis two or more different types of reaction, where at least one does not meet the requirements of the assigned sub-subclass.
Figure 3
Figure 3. Examples for the different subsets.
(a) Unclear assignment (b) Ambiguous, fits two or more quite similar sub-subclasses. (c) Does not fit any defined sub-subclass. (d) Different sub-subclasses assigned, based on the identical reaction.
Figure 4
Figure 4. The complete procedure is demonstrated for the reaction catalysed by the enzyme indolelactate dehydrogenase (EC 1.1.1.110).
(a) In the first step the known reaction pairs NAD+/NADH and accordingly NAD+/H+ are identfied and removed from further calculation steps. (b) The functional groups within the remaining molecules are identified (c), counted and eliminated in the case if they are equal in number. (d) For each remaining group a distinct key is assigned. (e) Finally, a difference key of the overall reaction is generated.

References

    1. Luscombe NM, Babu MM, Yu H, Snyder M, Teichmann SA, et al. Genomic analysis of regulatory network dynamics reveals large topological changes. Nature. 2004;431:308–312. - PubMed
    1. Edwards JS, Palsson BO. The Escherichia coli MG1655 in silico metabolic genotype: Its definition, characteristics,and capabilities. Proc Natl Acad Sci U S A. 2000;97:5528–5533. - PMC - PubMed
    1. Maglott D, Ostell J, Pruitt KD, Tatusova T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 2007;35:D26–D31. - PMC - PubMed
    1. Hubbard TJ, Aken BL, Beal K, Ballester B, Caccamo M, et al. Ensembl 2007. Nucleic Acids Res. 2007;35:D610–D617. - PMC - PubMed
    1. Goto S, Okuno Y, Hattori M, Nishioka T, Kanehisa M. LIGAND: database of chemical compounds and reactions in biological pathways. Nucleic Acids Res. 2002;30:402–404. - PMC - PubMed

Publication types