Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 16.
doi: 10.1038/s41587-025-02738-1. Online ahead of print.

Comprehensive taxonomic identification of microbial species in metagenomic data using SingleM and Sandpiper

Affiliations

Comprehensive taxonomic identification of microbial species in metagenomic data using SingleM and Sandpiper

Ben J Woodcroft et al. Nat Biotechnol. .

Abstract

Determining the taxonomy and relative abundance of microorganisms in metagenomic data remains technically challenging. Here we present 'SingleM', which estimates community composition using conserved regions within universal marker genes. By accurately incorporating species lacking genomic representation, we show that unknown species dominate in most environmental microbial communities. Our website 'Sandpiper' collates microbial community profiles from 248,559 publicly available metagenomes.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Similar articles

Cited by

References

    1. Meyer, F. et al. Critical Assessment of Metagenome Interpretation: the second round of challenges. Nat. Methods 19, 429–440 (2022). - PubMed - PMC
    1. Poussin, C. et al. Crowdsourced benchmarking of taxonomic metagenome profilers: lessons learned from the sbv IMPROVER Microbiomics challenge. BMC Genomics 23, 624 (2022). - PubMed - PMC
    1. Menzel, P., Ng, K. L. & Krogh, A. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat. Commun. 7, 11257 (2016). - PubMed - PMC
    1. Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021). - PubMed - PMC
    1. Parks, D. H. et al. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. Nucleic Acids Res. 50, D785–D794 (2022). - PubMed

LinkOut - more resources