Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 8;47(D1):D666-D677.
doi: 10.1093/nar/gky901.

IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes

Affiliations

IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes

I-Min A Chen et al. Nucleic Acids Res. .

Abstract

The Integrated Microbial Genomes & Microbiomes system v.5.0 (IMG/M: https://img.jgi.doe.gov/m/) contains annotated datasets categorized into: archaea, bacteria, eukarya, plasmids, viruses, genome fragments, metagenomes, cell enrichments, single particle sorts, and metatranscriptomes. Source datasets include those generated by the DOE's Joint Genome Institute (JGI), submitted by external scientists, or collected from public sequence data archives such as NCBI. All submissions are typically processed through the IMG annotation pipeline and then loaded into the IMG data warehouse. IMG's web user interface provides a variety of analytical and visualization tools for comparative analysis of isolate genomes and metagenomes in IMG. IMG/M allows open access to all public genomes in the IMG data warehouse, while its expert review (ER) system (IMG/MER: https://img.jgi.doe.gov/mer/) allows registered users to access their private genomes and to store their private datasets in workspace for sharing and for further analysis. IMG/M data content has grown by 60% since the last report published in the 2017 NAR Database Issue. IMG/M v.5.0 has a new and more powerful genome search feature, new statistical tools, and supports metagenome binning.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
IMG & GOLD Citation Overview. (i) IMG & GOLD Citation Overview page shows all papers citing IMG or GOLD. Publications are divided into different categories. (ii) Users can click on the category to see a set of papers in this category. The list can be searched or sorted. The example shows searching on ‘nucleic acids research’ (case insensitive) from the Search field at the right upper corner. (iii) Clicking on a title will lead to the actual publication.
Figure 2.
Figure 2.
Quick Search option for the new Genome Search feature. (i) The Quick Search option allows users to type in a keyword to search all IMG genomes. (ii) The search results can be added to Genome Cart.
Figure 3.
Figure 3.
Example application of the Advanced Search Builder option of the new Genome Search feature. (i) Users can build a complex query to find all soil metagenome datasets sampled at depth of up to 10 cm in Wisconsin or Michigan that are not classified as agricultural soils. (ii) Users can click the Evaluate Query button to see statistics information. (iii) Query result is displayed after the Search button is clicked.
Figure 4.
Figure 4.
Search History of the new Genome Search feature. (i) All searches done in a session will be saved. Users can also reconstruct and search any of the selected queries. (ii) Expert Review users have the additional ability to save any queries into the Workspace.
Figure 5.
Figure 5.
Decision tree for selection of default statistical test method. FDR is false discovery rate.
Figure 6.
Figure 6.
The new analysis tools are available in the Statistical Analysis tab of Workspace Genome Sets. In this particular example, the user selects two genome sets to measure gene count by Pfam using default system recommendation. Users can gather more information regarding analysis methods by clicking on the question mark to view a detailed description. The analysis will be run on the background and the result will be saved as a new job. The user can click on the Run Analysis button to submit the analysis request. UI will inform the user which default analysis method has been chosen.
Figure 7.
Figure 7.
Analysis statuses and results are available from My Jobs in Workspace. A job starts with waiting status. Users will be able to view the analysis result when a job is complete. The result data table can be exported. Users can also select to download a complete report.
Figure 8.
Figure 8.
New metagenome bins in IMG. (i) Metagenome Statistics in the metagenome detail page shows the number of bins. (ii) Users can view more detailed information of bins by clicking on the count. Expert Review users can also select one or more bins to save as workspace scaffold sets. (iii) After the user clicks the scaffold count, a new Metagenome Bin Scaffolds page will show up listing all scaffolds in the bin together with more detailed information on each scaffold.

References

    1. Benson D.A., Cavanaugh M., Clark K., Karsch-Mizrahi I., Lipman D.J., Ostell J., Sayers E.W.. GenBank. Nucleic Acids Res. 2013; 41:D36–D42. - PMC - PubMed
    1. Mukherjee S., Stamatis D., Bertsch J., Ovchinnikova G., Verezemska O., Isbandi M., Thomas A., Ali R., Sharma K., Kyrpides N.C. et al. Genomes OnLine Database (GOLD) v.6: data updates and feature enhancements. Nucleic Acids Res. 2017; 45:D446–D456. - PMC - PubMed
    1. Huntemann M., Ivanova N.N., Mavromatis K., Tripp H.J., Paez-Espino D., Palaniappan K., Szeto E., Pillay M., Chen I.A., Pati A. et al. The standard operating procedure of the DOE-JGI microbial genome annotation pipeline (MGAP v. 4). Stand. Genomic Sci. 2015; 10:86. - PMC - PubMed
    1. Field D., Sterk P., Kottmann R., De Smet J.W., Amaral-Zettler L., Cochrane G., Cole J.R., Davies N., Dawyndt P., Garrity G.M. et al. Genomic standards consortium projects. Stand Genomic Sci. 2014; 9:599–601. - PMC - PubMed
    1. Bland C., Ramsey T.L., Sabree F., Lowe M., Brown K., Kyrpides N.C., Hugenholtz P.. CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics. 2007; 8:209. - PMC - PubMed

Publication types