Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2017 Mar 6:8:23.
doi: 10.3389/fgene.2017.00023. eCollection 2017.

A Review of Bioinformatics Tools for Bio-Prospecting from Metagenomic Sequence Data

Affiliations
Review

A Review of Bioinformatics Tools for Bio-Prospecting from Metagenomic Sequence Data

Despoina D Roumpeka et al. Front Genet. .

Abstract

The microbiome can be defined as the community of microorganisms that live in a particular environment. Metagenomics is the practice of sequencing DNA from the genomes of all organisms present in a particular sample, and has become a common method for the study of microbiome population structure and function. Increasingly, researchers are finding novel genes encoded within metagenomes, many of which may be of interest to the biotechnology and pharmaceutical industries. However, such "bioprospecting" requires a suite of sophisticated bioinformatics tools to make sense of the data. This review summarizes the most commonly used bioinformatics tools for the assembly and annotation of metagenomic sequence data with the aim of discovering novel genes.

Keywords: assembly; bioinformatics; bioprospecting; gene prediction; metagenomics; next generation sequencing.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
A typical bioinformatics pipeline. The genomic material (taken directly from the environmental sample) is sequenced and processed using assembly, gene prediction and gene annotation tools. Finally, the findings are shared between the scientific groups around the world.

References

    1. Alneberg J., Bjarnason B. S., de Bruijn I., Schirmer M., Quick J., Ijaz U. Z., et al. (2014). Binning metagenomic contigs by coverage and composition. Nat. Methods 11 1144–1146. 10.1038/nmeth.3103 - DOI - PubMed
    1. Aziz R. K., Bartels D., Best A. A., DeJongh M., Disz T., Edwards R. A., et al. (2008). The RAST server: rapid annotations using subsystems technology. BMC Genomics 8:75 10.1186/1471-2164-9-75 - DOI - PMC - PubMed
    1. Bloom B. H. (1970). Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13 422–426. 10.1145/362686.362692 - DOI
    1. Boisvert S., Raymond F., Godzaridis E., Laviolette F., Corbeil J. (2012). Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol. 13:R122 10.1186/gb-2012-13-12-r122 - DOI - PMC - PubMed
    1. Bowe A., Onodera T., Sadakane K., Shibuya T. (2012). “Succinct de bruijn graphs,” in Proceedings of the 12th International Workshop, WABI 2012 Ljubljana, Slovenia, September 10-12 2012: Algorithms in Bioinformatics, eds Raphael B., Tang J. (Berlin: Springer; ), 225–235. 10.1007/978-3-642-33122-0_18 - DOI

LinkOut - more resources