Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 28:12:giad022.
doi: 10.1093/gigascience/giad022. Epub 2023 Apr 10.

The GEN-ERA toolbox: unified and reproducible workflows for research in microbial genomics

Affiliations

The GEN-ERA toolbox: unified and reproducible workflows for research in microbial genomics

Luc Cornet et al. Gigascience. .

Abstract

Background: Microbial culture collections play a key role in taxonomy by studying the diversity of their strains and providing well-characterized biological material to the scientific community for fundamental and applied research. These microbial resource centers thus need to implement new standards in species delineation, including whole-genome sequencing and phylogenomics. In this context, the genomic needs of the Belgian Coordinated Collections of Microorganisms were studied, resulting in the GEN-ERA toolbox. The latter is a unified cluster of bioinformatic workflows dedicated to both bacteria and small eukaryotes (e.g., yeasts).

Findings: This public toolbox allows researchers without a specific training in bioinformatics to perform robust phylogenomic analyses. Hence, it facilitates all steps from genome downloading and quality assessment, including genomic contamination estimation, to tree reconstruction. It also offers workflows for average nucleotide identity comparisons and metabolic modeling.

Technical details: Nextflow workflows are launched by a single command and are available on the GEN-ERA GitHub repository (https://github.com/Lcornet/GENERA). All the workflows are based on Singularity containers to increase reproducibility.

Testing: The toolbox was developed for a diversity of microorganisms, including bacteria and fungi. It was further tested on an empirical dataset of 18 (meta)genomes of early branching Cyanobacteria, providing the most up-to-date phylogenomic analysis of the Gloeobacterales order, the first group to diverge in the evolutionary tree of Cyanobacteria.

Conclusion: The GEN-ERA toolbox can be used to infer completely reproducible comparative genomic and metabolic analyses on prokaryotes and small eukaryotes. Although designed for routine bioinformatics of culture collections, it can also be used by all researchers interested in microbial taxonomy, as exemplified by our case study on Gloeobacterales.

Keywords: Gloeobacterales; Cyanobacteria; Singularity containers; culture collections; genomics; metagenomics; nextflow; phylogenomics; phylogeny; workflow.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1:
Figure 1:
Overview of the GEN-ERA toolbox.
Figure 2:
Figure 2:
Results of the Gloeobacterales analysis. (A) Phylogenomic analysis of the Gloeobacterales order, conducted on 198 core genes using DNA sequences. The tree was inferred with RAxML under the GTRGAMMA model on a supermatrix of 21 × 225,524 unambiguously aligned nucleotide positions. (B) SSU rRNA phylogeny constrained by a phylogenomic analysis of ribosomal proteins, computed with ORPER. (C) Metabolic modeling of Gloeobacterales and closely associated taxa. Detailed methods and results of the Gloeobacterales analysis are available in Supplemental File 1. Gloeobacterales are indicated in red.

Similar articles

Cited by

References

    1. Goris J, Konstantinidis KT, Klappenbach JA, et al. DNA–DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol. 2007;57:81–91. - PubMed
    1. Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A. 2009;106:19126–31. - PMC - PubMed
    1. Tindall BJ, Rosselló-Móra R, Busse H-J, et al. Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol. 2010;60:249–66. - PubMed
    1. Lachance M-A, Lee DK, Hsiang T. Delineating yeast species with genome average nucleotide identity: a calibration of ANI with haplontic, heterothallic metschnikowia species. Antonie Van Leeuwenhoek. 2020;113:2097–106. - PubMed
    1. Parks DH, Chuvochina M, Chaumeil P-A, et al. Selection of representative genomes for 24,706 bacterial and archaeal species clusters provide a complete genome-based taxonomy. Biorxiv. 2019. bioRxiv.10.1101/771964 - DOI

Publication types