Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Aug 2;2(4):e00237-17.
doi: 10.1128/mSphere.00237-17. eCollection 2017 Jul-Aug.

Hiding in Plain Sight: Mining Bacterial Species Records for Phenotypic Trait Information

Affiliations

Hiding in Plain Sight: Mining Bacterial Species Records for Phenotypic Trait Information

Albert Barberán et al. mSphere. .

Abstract

Cultivation in the laboratory is essential for understanding the phenotypic characteristics and environmental preferences of bacteria. However, basic phenotypic information is not readily accessible. Here, we compiled phenotypic and environmental tolerance information for >5,000 bacterial strains described in the International Journal of Systematic and Evolutionary Microbiology (IJSEM) with all information made publicly available in an updatable database. Although the data span 23 different bacterial phyla, most entries described aerobic, mesophilic, neutrophilic strains from Proteobacteria (mainly Alpha- and Gammaproteobacteria), Actinobacteria, Firmicutes, and Bacteroidetes isolated from soils, marine habitats, and plants. Most of the routinely measured traits tended to show a significant phylogenetic signal, although this signal was weak for environmental preferences. We demonstrated how this database could be used to link genomic attributes to differences in pH and salinity optima. We found that adaptations to high salinity or high-pH conditions are related to cell surface transporter genes, along with previously uncharacterized genes that might play a role in regulating environmental tolerances. Together, this work highlights the utility of this database for associating bacterial taxonomy, phylogeny, or specific genes to measured phenotypic traits and emphasizes the need for more comprehensive and consistent measurements of traits across a broader diversity of bacteria. IMPORTANCE Cultivation in the laboratory is key for understanding the phenotypic characteristics, growth requirements, metabolism, and environmental preferences of bacteria. However, oftentimes, phenotypic information is not easily accessible. Here, we compiled phenotypic and environmental tolerance information for >5,000 bacterial strains described in the International Journal of Systematic and Evolutionary Microbiology (IJSEM). We demonstrate how this database can be used to link bacterial taxonomy, phylogeny, or specific genes to measured phenotypic traits and environmental preferences. The phenotypic database can be freely accessed (https://doi.org/10.6084/m9.figshare.4272392), and we have included instructions for researchers interested in adding new entries or curating existing ones.

Keywords: pH; phenotypes; phylogeny; salinity; traits.

PubMed Disclaimer

Figures

FIG 1
FIG 1
Taxonomic distribution (A) and habitat distribution (B) of the >4,000 bacterial strains present in the phenotype database. The inset in panel A shows the strain representation of the major proteobacterial subgroups in the database. Note that in panel B the habitat is the environment from which each strain was originally isolated (if reported) and may not accurately reflect where those strains may be most abundant.
FIG 2
FIG 2
Distribution of selected traits across the >4,000 strains in the most recent version of the database, including cell shape (A), spore formation (B), motility (C), oxygen requirements (D), temperature optimum (E), and pH optimum (F). The number of strains with information for a particular trait is indicated in parentheses.
FIG 3
FIG 3
Phylogenetic signal of selected traits: presence of pigment (A), spore formation (B), pH optima (C), and temperature optima (D). For categorical variables (A and B), the red columns indicate presence. For continuous variables (C and D), the red columns indicate the reported value.

References

    1. Giovannoni S, Stingl U. 2007. The importance of culturing bacterioplankton in the ‘omics’ age. Nat Rev Microbiol 5:820–826. doi: 10.1038/nrmicro1752. - DOI - PubMed
    1. Temperton B, Giovannoni SJ. 2012. Metagenomics: microbial diversity through a scratched lens. Curr Opin Microbiol 15:605–612. doi: 10.1016/j.mib.2012.07.001. - DOI - PubMed
    1. Kyrpides NC, Hugenholtz P, Eisen JA, Woyke T, Göker M, Parker CT, Amann R, Beck BJ, Chain PSG, Chun J, Colwell RR, Danchin A, Dawyndt P, Dedeurwaerdere T, DeLong EF, Detter JC, De Vos P, Donohue TJ, Dong XZ, Ehrlich DS, Fraser C, Gibbs R, Gilbert J, Gilna P, Glöckner FO, Jansson JK, Keasling JD, Knight R, Labeda D, Lapidus A, Lee JS, Li WJ, Ma J, Markowitz V, Moore ERB, Morrison M, Meyer F, Nelson KE, Ohkuma M, Ouzounis CA, Pace N, Parkhill J, Qin N, Rossello-Mora R, Sikorski J, Smith D, Sogin M, Stevens R, Stingl U, Suzuki K-I, Taylor D, Tiedje JM, Tindall B, Wagner M, Weinstock G, Weissenbach J, White O, Wang J, Zhang L, Zhou Y-G, Field D, Whitman WB, Garrity GM, Klenk H-P. 2014. Genomic encyclopedia of bacteria and archaea: sequencing a myriad of type strains. PLoS Biol 12:e1001920. doi: 10.1371/journal.pbio.1001920. - DOI - PMC - PubMed
    1. Konstantinidis KT, Ramette A, Tiedje JM. 2006. The bacterial species definition in the genomic era. Philos Trans R Soc Lond B Biol Sci 361:1929–1940. doi: 10.1098/rstb.2006.1920. - DOI - PMC - PubMed
    1. Rinke C, Schwientek P, Sczyrba A, Ivanova NN, Anderson IJ, Cheng JF, Darling A, Malfatti S, Swan BK, Gies EA, Dodsworth JA, Hedlund BP, Tsiamis G, Sievert SM, Liu WT, Eisen JA, Hallam SJ, Kyrpides NC, Stepanauskas R, Rubin EM, Hugenholtz P, Woyke T. 2013. Insights into the phylogeny and coding potential of microbial dark matter. Nature 499:431–437. doi: 10.1038/nature12352. - DOI - PubMed