Metagenomic signatures of 86 microbial and viral metagenomes
- PMID: 19302541
- DOI: 10.1111/j.1462-2920.2009.01901.x
Metagenomic signatures of 86 microbial and viral metagenomes
Abstract
Previous studies have shown that dinucleotide abundances capture the majority of variation in genome signatures and are useful for quantifying lateral gene transfer and building molecular phylogenies. Metagenomes contain a mixture of individual genomes, and might be expected to lack compositional signatures. In many metagenomic data sets the majority of sequences have no significant similarities to known sequences and are effectively excluded from subsequent analyses. To circumvent this limitation, di-, tri- and tetranucleotide abundances of 86 microbial and viral metagenomes consisting of short pyrosequencing reads were analysed to provide a method which includes all sequences that can be used in combination with other analysis to increase our knowledge about microbial and viral communities. Both principal component analysis and hierarchical clustering showed definitive groupings of metagenomes drawn from similar environments. Together these analyses showed that dinucleotide composition, as opposed to tri- and tetranucleotides, defines a metagenomic signature which can explain up to 80% of the variance between biomes, which is comparable to that obtained by functional genomics. Metagenomes with anomalous content were also identified using dinucleotide abundances. Subsequent analyses determined that these metagenomes were contaminated with exogenous DNA, suggesting that this approach is a useful metric for quality control. The predictive strength of the dinucleotide composition also opens the possibility of assigning ecological classifications to unknown fragments. Environmental selection may be responsible for this dinucleotide signature through direct selection of specific compositional signals; however, simulations suggest that the environment may select indirectly by promoting the increased abundance of a few dominant taxa.
Similar articles
-
Comparison of multiple metagenomes using phylogenetic networks based on ecological indices.ISME J. 2010 Oct;4(10):1236-42. doi: 10.1038/ismej.2010.51. Epub 2010 Apr 29. ISME J. 2010. PMID: 20428222
-
Capturing the uncultivated majority.Curr Opin Biotechnol. 2006 Jun;17(3):236-40. doi: 10.1016/j.copbio.2006.05.004. Epub 2006 May 15. Curr Opin Biotechnol. 2006. PMID: 16701994 Review.
-
Megraft: a software package to graft ribosomal small subunit (16S/18S) fragments onto full-length sequences for accurate species richness and sequencing depth analysis in pyrosequencing-length metagenomes and similar environmental datasets.Res Microbiol. 2012 Jul;163(6-7):407-12. doi: 10.1016/j.resmic.2012.07.001. Epub 2012 Jul 21. Res Microbiol. 2012. PMID: 22824070
-
Identifying biologically relevant differences between metagenomic communities.Bioinformatics. 2010 Mar 15;26(6):715-21. doi: 10.1093/bioinformatics/btq041. Epub 2010 Feb 3. Bioinformatics. 2010. PMID: 20130030
-
Microbial ecology in the age of genomics and metagenomics: concepts, tools, and recent advances.Mol Ecol. 2006 Jun;15(7):1713-31. doi: 10.1111/j.1365-294X.2006.02882.x. Mol Ecol. 2006. PMID: 16689892 Review.
Cited by
-
The GAAS metagenomic tool and its estimations of viral and microbial average genome size in four major biomes.PLoS Comput Biol. 2009 Dec;5(12):e1000593. doi: 10.1371/journal.pcbi.1000593. Epub 2009 Dec 11. PLoS Comput Biol. 2009. PMID: 20011103 Free PMC article.
-
Metagenomics of Coral Reefs Under Phase Shift and High Hydrodynamics.Front Microbiol. 2018 Oct 4;9:2203. doi: 10.3389/fmicb.2018.02203. eCollection 2018. Front Microbiol. 2018. PMID: 30337906 Free PMC article.
-
Metavir 2: new tools for viral metagenome comparison and assembled virome analysis.BMC Bioinformatics. 2014 Mar 19;15:76. doi: 10.1186/1471-2105-15-76. BMC Bioinformatics. 2014. PMID: 24646187 Free PMC article.
-
Blobology: exploring raw genome data for contaminants, symbionts and parasites using taxon-annotated GC-coverage plots.Front Genet. 2013 Nov 29;4:237. doi: 10.3389/fgene.2013.00237. eCollection 2013. Front Genet. 2013. PMID: 24348509 Free PMC article.
-
Substrate type determines metagenomic profiles from diverse chemical habitats.PLoS One. 2011;6(9):e25173. doi: 10.1371/journal.pone.0025173. Epub 2011 Sep 23. PLoS One. 2011. PMID: 21966446 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources