Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 8;47(D1):D382-D389.
doi: 10.1093/nar/gky1054.

MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons

Affiliations

MBGD update 2018: microbial genome database based on hierarchical orthology relations covering closely related and distantly related comparisons

Ikuo Uchiyama et al. Nucleic Acids Res. .

Abstract

The Microbial Genome Database for Comparative Analysis (MBGD) is a database for comparative genomics based on comprehensive orthology analysis of bacteria, archaea and unicellular eukaryotes. MBGD now contains 6318 genomes. To utilize the database for both closely related and distantly related genomes, MBGD previously provided two types of ortholog tables: the standard ortholog table containing one representative genome from each genus covering the entire taxonomic range and the taxon specific ortholog tables for each taxon. However, this approach has a drawback in that the standard ortholog table contains only genes that are conserved in the representative genomes. To address this problem, we developed a stepwise procedure to construct ortholog tables hierarchically in a bottom-up manner. By using this approach, the new standard ortholog table now covers the entire gene repertoire stored in MBGD. In addition, we have enhanced several functionalities, including rapid and flexible keyword searching, profile-based sequence searching for orthology assignment to a user query sequence, and displaying a phylogenetic tree of each taxon based on the concatenated core gene sequences. For integrative database searching, the core data in MBGD are represented in Resource Description Framework (RDF) and a SPARQL interface is provided to search them. MBGD is available at http://mbgd.genome.ad.jp/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The bottom-up procedure for constructing hierarchical orthology relationships. (A) Overview of the procedure. The procedure progresses from bottom to top. (B) Hierarchical ortholog groups. Here, the construction process goes from right to left and the expansion process goes from left to right. A representative gene in each cluster is indicated in red, and the target clusters to be expanded are underlined. A gene in a pan-genome is represented as ‘taxid:clustid’, which is actually the representative gene of the cluster. The number in parentheses is the domain number and the two numbers after each gene name are the beginning and end positions of the domain. (C) Domain boundary mapping between clusters at different levels. The example is the same as in B. The red segment corresponds to the domain tax44249:7443(2) in the standard cluster 98932. Missing positions by this mapping are filled by a simple linear interpolation, shown by the numbers in parentheses.
Figure 2.
Figure 2.
Overall procedure for constructing MBGD. This figure is an update of the previous version (3).
Figure 3.
Figure 3.
Screenshots of the new functionalities in MBGD. (A) An example of a hierarchical ortholog group. Shown is the ortholog group containing Shiga-like toxin subunit A. (B) A phylogenetic tree shown in the ortholog table summary viewer. Shown is a part of the phylogenetic tree created from the conserved orthologs of the family Bacillaceae. (C) The output of the profile search using MMseqs2.
Figure 4.
Figure 4.
Interfaces for searching and browsing MBGD. Interfaces are shown in the light pink boxes.

Similar articles

Cited by

References

    1. Tettelin H., Masignani V., Cieslewicz M.J., Donati C., Medini D., Ward N.L., Angiuoli S.V., Crabtree J., Jones A.L., Durkin A.S. et al. . Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome”. Proc. Natl. Acad. Sci. U.S.A. 2005; 102:13950–13955. - PMC - PubMed
    1. Uchiyama I. MBGD: microbial genome database for comparative analysis. Nucleic Acids Res. 2003; 31:58–62. - PMC - PubMed
    1. Uchiyama I., Mihara M., Nishide H., Chiba H.. MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data. Nucleic Acids Res. 2015; 43:D270–D276. - PMC - PubMed
    1. Uchiyama I. MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups. Nucleic Acids Res. 2007; 35:D343–D346. - PMC - PubMed
    1. Uchiyama I., Mihara M., Nishide H., Chiba H.. MBGD update 2013: the microbial genome database for exploring the diversity of microbial world. Nucleic Acids Res. 2013; 41:D631–D635. - PMC - PubMed

Publication types