Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 6;51(D1):D587-D592.
doi: 10.1093/nar/gkac963.

KEGG for taxonomy-based analysis of pathways and genomes

Affiliations

KEGG for taxonomy-based analysis of pathways and genomes

Minoru Kanehisa et al. Nucleic Acids Res. .

Abstract

KEGG (https://www.kegg.jp) is a manually curated database resource integrating various biological objects categorized into systems, genomic, chemical and health information. Each object (database entry) is identified by the KEGG identifier (kid), which generally takes the form of a prefix followed by a five-digit number, and can be retrieved by appending /entry/kid in the URL. The KEGG pathway map viewer, the Brite hierarchy viewer and the newly released KEGG genome browser can be launched by appending /pathway/kid, /brite/kid and /genome/kid, respectively, in the URL. Together with an improved annotation procedure for KO (KEGG Orthology) assignment, an increasing number of eukaryotic genomes have been included in KEGG for better representation of organisms in the taxonomic tree. Multiple taxonomy files are generated for classification of KEGG organisms and viruses, and the Brite hierarchy viewer is used for taxonomy mapping, a variant of Brite mapping in the new KEGG Mapper suite. The taxonomy mapping enables analysis of, for example, how functional links of genes in the pathway and physical links of genes on the chromosome are conserved among organism groups.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
KEGG is an integrated database containing various biological objects (contents) stored in sixteen databases in four categories. Each object (database entry) is identified by the KEGG identifier (kid) as defined here. Except for GENES and a few other databases, the KEGG identifier takes the form of a prefix followed by a five-digit number. Manually created reference pathway maps, reference Brite hierarchies and reference modules are computationally expanded to organism-specific ones by converting KO identifiers (K numbers) to GENES identifiers in individual organisms (see text).
Figure 2.
Figure 2.
Simple URLs, called KEGG weblinks, to retrieve and analyze KEGG objects (database entries). The entry operation retrieves any object specified by the KEGG identifier in the flat-file format. The pathway, brite, module or network operation retrieves a molecular network object specified by the KEGG identifier with optional highlighting of network nodes given in the argument. The genome operation retrieves the genome map of a given organism with optional specification of a gene location. The pathway, brite and genome operations actually launch specialized tools enabling further analysis. In the URL form, www.kegg.jp may be replaced by www.genome.jp for accessing the GenomeNet mirror site.
Figure 3.
Figure 3.
The KEGG taxonomy browser, implemented as the Brite hierarchy viewer for taxonomy files, is now linked from the Organism menu of each KEGG pathway map. This example is linked from the lysine biosynthesis pathway map00300, included here by highlighting the plant-specific module M00527, and provides a taxonomy-based global view of organisms and organism groups in which the pathway and associated modules are present.
Figure 4.
Figure 4.
The KEGG genome browser is used to display syntenic regions in multiple genomes. Here, a region of eight genes represented as a sequence of eight K numbers is searched against all KEGG organisms. The result of matching organisms is shown in the taxonomy tree, from which selected genomes are displayed by aligning syntenic regions. Color coding is based on the functional category in KEGG.

References

    1. Kanehisa M., Furumichi M., Sato Y., Ishiguro-Watanabe M., Tanabe M.. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021; 49:D545–D551. - PMC - PubMed
    1. Kanehisa M., Sato Y., Kawashima M.. KEGG mapping tools for uncovering hidden features in biological data. Protein Sci. 2022; 31:47–53. - PMC - PubMed
    1. Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019; 28:1947–1951. - PMC - PubMed
    1. Fleischmann R.D., Adams M.D., White O., Clayton R.A., Kirkness E.F., Kerlavage A.R., Bult C.J., Tomb J.F., Dougherty B.A., Merrick J.M.et al. .. Whole-genome random sequencing and assembly of haemophilus influenzae rd. Science. 1995; 269:496–512. - PubMed
    1. Sayers E.W., Beck J., Bolton E.E., Bourexis D., Brister J.R., Canese K., Comeau D.C., Funk K., Kim S., Klimke W.et al. .. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022; 50:D20–D26. - PMC - PubMed

Publication types