Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr;25(3):e14056.
doi: 10.1111/1755-0998.14056. Epub 2024 Dec 30.

Teknonaturalist: A Snakemake Pipeline for Assessing Fungal Diversity From Plant Genome Bycatch

Affiliations

Teknonaturalist: A Snakemake Pipeline for Assessing Fungal Diversity From Plant Genome Bycatch

Nicholas W Bard et al. Mol Ecol Resour. 2025 Apr.

Abstract

Relatively little is known of the host associations and compatibility of fungal plant pathogens and endophytes. Publicly available plant genomic DNA can be mined to detect incidental fungal DNA, but taxonomic assignment can be challenging due to short lengths and variable discriminative power among different genomic regions and taxa. Here, we introduce a computationally lightweight and accessible Snakemake pipeline for rapid detection and classification (identification and assignment to taxonomic rank) of pathogenic and endophytic fungi (and other fungi associated with plants) that targets the internal transcribed spacer (ITS) region, a fungal barcode standard. We include methods for maximising query sequence length, which gives higher support for ITS1 and ITS2 taxonomic classifications by extending to other fragments of the ITS region and providing taxon-specific local cut-off and confidence scores. We demonstrate our pipeline with a case study using public genomic sequence data for six diverse plant species, including four species within Betula, an ecologically and economically important broadleaved forest tree genus, a shrub and a grass. Our pipeline classified fungi within minutes to a few hours per host individual, with 204 different fungal genera identified at high confidence (≥ 70%). Our pipeline detected and classified pathogenic and endophytic genera known to associate with Betula, and many others with no prior record of association. Our pipeline, leveraging existing sequence data, has several potential applications, including detecting cryptic fungal pathogens and helping characterise the endophytic fungal microbiome, bioprospecting commercially useful fungal species, and determining the plant host range of fungi.

Keywords: ITS; endophyte; fungi; genome bycatch; plant pathogens; plant‐fungal associations.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Pipeline overview. See text for full description of each step. These recommended steps can be performed using Snakemake for ease of implementation.
FIGURE 2
FIGURE 2
Fungal genera detected in three Betula taxa (Betula ermanii, B. pendula, B. platyphylla ), Buddleja alternifolia and Panicum virgatum. All phyla, and class, order, and family taxonomic groups with > 1 genus detected among samples, are labelled. Fungal genera are highlighted to indicate FUNGuild designation to endophyte or pathogen guilds. Black circles next to fungal genera indicate support from flanking region (ITS or 5.8S). At right, a heatmap of the plant host taxa with detected fungal genera relative frequencies shown.
FIGURE 3
FIGURE 3
PCoA of Sørenson dissimilarity across fungal communities (one or more taxa) detected from Betula ermanii, B. pendula, B. platyphylla, Buddleja alternifolia and Panicum virgatum . Different plant individuals are represented by blue circles. Plant taxa are represented by ‘spiders’.

Similar articles

References

    1. Abarenkov, K. , Zirk A., Piirman T., et al. 2022. “UNITE General FASTA Release for Fungi 2.” UNITE Community. 10.15156/BIO/2483912. - DOI
    1. Abarenkov, K. , Zirk A., Piirman T., et al. 2024. “Full UNITE+INSD Dataset for Fungi.” UNITE Community. 10.15156/BIO/2959330. - DOI
    1. Anagnostakis, S. L. 1987. “Chestnut Blight: The Classical Problem of an Introduced Pathogen.” Mycologia 79, no. 1: 23–37.
    1. Andrews, S. 2010. “FastQC a Quality‐Control Tool for High‐Throughput Sequence Data.” [Computer Software]. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
    1. Antil, S. , Abraham J. S., Sripoorna S., et al. 2023. “DNA Barcoding, an Effective Tool for Species Identification: A Review.” Molecular Biology Reports 50, no. 1: 761–775. 10.1007/s11033-022-08015-7. - DOI - PubMed

LinkOut - more resources