Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Aug;32(8):822-8.
doi: 10.1038/nbt.2939. Epub 2014 Jul 6.

Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

Collaborators, Affiliations

Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes

H Bjørn Nielsen et al. Nat Biotechnol. 2014 Aug.

Abstract

Most current approaches for analyzing metagenomic data rely on comparisons to reference genomes, but the microbial diversity of many environments extends far beyond what is covered by reference databases. De novo segregation of complex metagenomic data into specific biological entities, such as particular bacterial strains or viruses, remains a largely unsolved problem. Here we present a method, based on binning co-abundant genes across a series of metagenomic samples, that enables comprehensive discovery of new microbial organisms, viruses and co-inherited genetic entities and aids assembly of microbial genomes without the need for reference sequences. We demonstrate the method on data from 396 human gut microbiome samples and identify 7,381 co-abundance gene groups (CAGs), including 741 metagenomic species (MGS). We use these to assemble 238 high-quality microbial genomes and identify affiliations between MGS and hundreds of viruses or genetic entities. Our method provides the means for comprehensive profiling of the diversity within complex metagenomic samples.

PubMed Disclaimer

References

    1. PLoS One. 2012;7(10):e47656 - PubMed
    1. BMC Bioinformatics. 2007 Jun 18;8:209 - PubMed
    1. Bioinformatics. 2009 Aug 1;25(15):1966-7 - PubMed
    1. J Bacteriol. 2011 Oct;193(19):5560-1 - PubMed
    1. Microb Ecol. 2010 Nov;60(4):708-20 - PubMed

Publication types