Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2016 Dec 5:15:48-55.
doi: 10.1016/j.csbj.2016.11.005. eCollection 2017.

Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics

Affiliations
Review

Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics

Karel Sedlar et al. Comput Struct Biotechnol J. .

Abstract

One of main steps in a study of microbial communities is resolving their composition, diversity and function. In the past, these issues were mostly addressed by the use of amplicon sequencing of a target gene because of reasonable price and easier computational postprocessing of the bioinformatic data. With the advancement of sequencing techniques, the main focus shifted to the whole metagenome shotgun sequencing, which allows much more detailed analysis of the metagenomic data, including reconstruction of novel microbial genomes and to gain knowledge about genetic potential and metabolic capacities of whole environments. On the other hand, the output of whole metagenomic shotgun sequencing is mixture of short DNA fragments belonging to various genomes, therefore this approach requires more sophisticated computational algorithms for clustering of related sequences, commonly referred to as sequence binning. There are currently two types of binning methods: taxonomy dependent and taxonomy independent. The first type classifies the DNA fragments by performing a standard homology inference against a reference database, while the latter performs the reference-free binning by applying clustering techniques on features extracted from the sequences. In this review, we describe the strategies within the second approach. Although these strategies do not require prior knowledge, they have higher demands on the length of sequences. Besides their basic principle, an overview of particular methods and tools is provided. Furthermore, the review covers the utilization of the methods in context with the length of sequences and discusses the needs for metagenomic data preprocessing in form of initial assembly prior to binning.

Keywords: Abundance; Genomic signature; Metagenomics; Sequence binning; Taxonomy independent; Visualization.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Schematic distribution of current taxonomy independent binning methods into three categories; the eye symbol highlights the methods that enable visualization of datasets.
Fig. 2
Fig. 2
Workflow of taxonomy independent binning strategies.

Similar articles

Cited by

References

    1. Kaeberlein T., Lewis K., Epstein S.S. Isolating “uncultivable” microorganisms in pure culture in a simulated natural environment. Science. 2002;296(5570):1127–1129. - PubMed
    1. Sleator R.D., Shortall C., Hill C. Metagenomics. Lett Appl Microbiol. 2008;47(5):361–366. - PubMed
    1. Reddy T.B.K., Thomas A.D., Stamatis D., Bertsch J., Isbandi M., Jansson J. The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification. Nucleic Acids Res. 2014 - PMC - PubMed
    1. Rondon M.R., August P.R., Bettermann A.D., Brady S.F., Grossman T.H., Liles Cloning the soil metagenome: a strategy for accessing the genetic and functional diversity of uncultured microorganisms. Appl Environ Microbiol. 2000;66(6):2541–2547. - PMC - PubMed
    1. Kennedy J., Marchesi J.R., Dobson A.D.W. Marine metagenomics: strategies for the discovery of novel enzymes with biotechnological applications from marine environments. Microb Cell Fact. 2008;7(1):1–8. - PMC - PubMed

LinkOut - more resources