Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data
- PMID: 31488068
- PMCID: PMC6727555
- DOI: 10.1186/s12859-019-3031-y
Fast and accurate average genome size and 16S rRNA gene average copy number computation in metagenomic data
Abstract
Background: Metagenomics caused a quantum leap in microbial ecology. However, the inherent size and complexity of metagenomic data limit its interpretation. The quantification of metagenomic traits in metagenomic analysis workflows has the potential to improve the exploitation of metagenomic data. Metagenomic traits are organisms' characteristics linked to their performance. They are measured at the genomic level taking a random sample of individuals in a community. As such, these traits provide valuable information to uncover microorganisms' ecological patterns. The Average Genome Size (AGS) and the 16S rRNA gene Average Copy Number (ACN) are two highly informative metagenomic traits that reflect microorganisms' ecological strategies as well as the environmental conditions they inhabit.
Results: Here, we present the ags.sh and acn.sh tools, which analytically derive the AGS and ACN metagenomic traits. These tools represent an advance on previous approaches to compute the AGS and ACN traits. Benchmarking shows that ags.sh is up to 11 times faster than state-of-the-art tools dedicated to the estimation AGS. Both ags.sh and acn.sh show comparable or higher accuracy than existing tools used to estimate these traits. To exemplify the applicability of both tools, we analyzed the 139 prokaryotic metagenomes of TARA Oceans and revealed the ecological strategies associated with different water layers.
Conclusion: We took advantage of recent advances in gene annotation to develop the ags.sh and acn.sh tools to combine easy tool usage with fast and accurate performance. Our tools compute the AGS and ACN metagenomic traits on unassembled metagenomes and allow researchers to improve their metagenomic data analysis to gain deeper insights into microorganisms' ecology. The ags.sh and acn.sh tools are publicly available using Docker container technology at https://github.com/pereiramemo/AGS-and-ACN-tools .
Keywords: 16S rRNA gene average copy number; Average genome size; Functional traits; Metagenomics; Microbial ecology.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures




Similar articles
-
Exploration of community traits as ecological markers in microbial metagenomes.Mol Ecol. 2012 Apr;21(8):1909-17. doi: 10.1111/j.1365-294X.2011.05383.x. Epub 2011 Nov 28. Mol Ecol. 2012. PMID: 22121910
-
PanFP: pangenome-based functional profiles for microbial communities.BMC Res Notes. 2015 Sep 26;8:479. doi: 10.1186/s13104-015-1462-8. BMC Res Notes. 2015. PMID: 26409790 Free PMC article.
-
COGNIZER: A Framework for Functional Annotation of Metagenomic Datasets.PLoS One. 2015 Nov 11;10(11):e0142102. doi: 10.1371/journal.pone.0142102. eCollection 2015. PLoS One. 2015. PMID: 26561344 Free PMC article.
-
Benchmarking Metagenomics Tools for Taxonomic Classification.Cell. 2019 Aug 8;178(4):779-794. doi: 10.1016/j.cell.2019.07.010. Cell. 2019. PMID: 31398336 Free PMC article. Review.
-
[A review on the bioinformatics pipelines for metagenomic research].Dongwuxue Yanjiu. 2012 Dec;33(6):574-85. doi: 10.3724/SP.J.1141.2012.06574. Dongwuxue Yanjiu. 2012. PMID: 23266976 Review. Chinese.
Cited by
-
Linking prokaryotic genome size variation to metabolic potential and environment.ISME Commun. 2023 Mar 27;3(1):25. doi: 10.1038/s43705-023-00231-x. ISME Commun. 2023. PMID: 36973336 Free PMC article.
-
Characterizing Wheat Rhizosphere Bacterial Microbiome Dynamics Under Salinity Stress: Insights from 16S rRNA Metagenomics for Enhancing Stress Tolerance.Plants (Basel). 2025 Mar 26;14(7):1033. doi: 10.3390/plants14071033. Plants (Basel). 2025. PMID: 40219101 Free PMC article.
-
Increasing pesticide diversity impairs soil microbial functions.Proc Natl Acad Sci U S A. 2025 Jan 14;122(2):e2419917122. doi: 10.1073/pnas.2419917122. Epub 2025 Jan 9. Proc Natl Acad Sci U S A. 2025. PMID: 39786931 Free PMC article.
-
Characterization of Environmental and Cultivable Antibiotic-Resistant Microbial Communities Associated with Wastewater Treatment.Antibiotics (Basel). 2021 Mar 26;10(4):352. doi: 10.3390/antibiotics10040352. Antibiotics (Basel). 2021. PMID: 33810449 Free PMC article.
-
Bacterial genome size and gene functional diversity negatively correlate with taxonomic diversity along a pH gradient.Nat Commun. 2023 Nov 17;14(1):7437. doi: 10.1038/s41467-023-43297-w. Nat Commun. 2023. PMID: 37978289 Free PMC article.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials