Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct 7;15(10):e72242.
doi: 10.1002/ece3.72242. eCollection 2025 Oct.

Pruning the Tree: Comparing OTUs and ASVs in High-Throughput Sequencing of 5S-IGS Nuclear Ribosomal DNA in Phylogenetic Studies

Affiliations

Pruning the Tree: Comparing OTUs and ASVs in High-Throughput Sequencing of 5S-IGS Nuclear Ribosomal DNA in Phylogenetic Studies

Simone Cardoni et al. Ecol Evol. .

Abstract

Amplicon sequencing of the nuclear ribosomal 5S RNA gene arrays is highly promising for genotaxonomy, to delineate genetic resources of species and trace their evolution. However, the huge amount of data retrieved with this approach is difficult to manage and prone to redundancy, error, and computational difficulties. Reducing the amount of data per sample without losing molecular-phylogenetic signal is therefore a crucial step for downstream analyses. In this work, we compared operational taxonomic units (OTUs, 100% identity) and amplicon sequence variants (ASVs) from 5S intergenic spacer (5S-IGS) amplicons of seven beech species (Fagus spp.) obtained with two widely used and competing bioinformatics tools, MOTHUR and DADA2. We assessed quantitative (total number of sequences retrieved, proportion of sequences unique or shared across the different data sets) and qualitative (sample diversity, data ruggedness and congruence of the obtained phylogenies) differences among sample data profiles obtained with the two methods, and the capacity of the inferred phylogenies to capture diagnostic 5S-IGS variant types. Over 70% of processed reads were shared between OTUs and ASVs. Despite a strong reduction (> 80%) of the representative sequences, DADA2-ASVs identified all main 5S-IGS variants known for Fagus, reflecting the phylogenetic, taxonomic and diversity patterns expected for each sample. In contrast, MOTHUR generated large proportions of rare OTUs and ASVs that complicated the obtained phylogenies and were inference-wise redundant. We conclude that differences in the sequence variation detected by the two pipelines are minimal and do not result in different phylogenetic information. The more effective and computationally more efficient DADA2 ASVs may thus replace OTUs in future 5S-IGS studies dealing with complex bioecological phenomena such as hybridisation, polyploidisation, drift and inferring evolutionary pathways of species systems, especially when using increasingly large sample sets.

Keywords: 5S nrDNA; DADA2; Fagus; MOTHUR; high‐throughput sequencing; phylogenetic analysis.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Venn diagram depicting the number of representative sequences unique to or shared across the different pipelines of data analysis.
FIGURE 2
FIGURE 2
Full maximum likelihood trees for sample #04 (F. caspica). Main 5S‐IGS types are annotated, asterisks denote reference tips after Denk et al. (2024). Trees are ordered by increasing tip sample size reduction. ML trees of all other samples are provided as File S3.
FIGURE 3
FIGURE 3
Collapsed maximum likelihood trees inferred on the DADA2 ‘non‐pooled’ ASV data sets and rooted on the Lineage O variants for the six samples belonging to Fagus subgen. Fagus, with the main types annotated (following Denk et al. 2024). For comparison, pie charts showing the relative abundance of main 5S‐IGS sequence variants as defined and discussed by Denk et al. (2024) are provided. Fagus japonica tree available in File S3.
FIGURE 4
FIGURE 4
Neighbour‐nets based on intersample ‘phylogenetic Bray–Curtis’ distances, using as ‘associates’: (a, b) MOTHUR‐OTUs/‐ASVs with an abundance of ≥ 4 (a: OTUs; b: ASVs); (c–e) DADA2 ASVs (c: ‘Pooled’; d: ‘Pseudo‐pooled’; e: ‘Nonpooled’ options). Edge bundles of corresponding and expected neighbourhoods coloured accordingly, thick lines indicate neighbourhoods in line with known phylogenetic relationships. Grey line (b, c) refers to the edge bundle reflecting a split into Western Eurasian and East Asian species. All graphs have the same scale, terminal edge leading to sample #06, F. japonica (stippled), has been reduced by a factor of 2.
FIGURE 5
FIGURE 5
Map of the investigated beech samples with pie charts showing the relative abundance of main 5S‐IGS sequence variants; phylogenetic classification of MOTHUR‐OTUs (ab ≥ 4) and nonpooled ASVs based on the sample‐wise trees and EPA (cf. Cardoni et al. 2022).

References

    1. Antich, A. , Palacin C., Wangensteen O. S., and Turon X.. 2021. “To Denoise or to Cluster, That Is Not the Question: Optimizing Pipelines for COI Metabarcoding and Metaphylogeography.” BMC Bioinformatics 22: 1–24. 10.1186/s12859-021-04115-6. - DOI - PMC - PubMed
    1. Bakker, M. G. 2018. “A Fungal Mock Community Control for Amplicon Sequencing Experiments.” Molecular Ecology Resources 18: 541–556. 10.1111/1755-0998.12760. - DOI - PubMed
    1. Bálint, M. , Bahram M., Eren A. M., et al. 2016. “Millions of Reads, Thousands of Taxa: Microbial Community Structure and Associations Analyzed via Marker Genes.” FEMS Microbiology Reviews 40: 686–700. 10.1093/femsre/fuw017. - DOI - PubMed
    1. Baum, B. R. , Johnson D. A., and Bailey L. G.. 2001. “Defining Orthologous Groups Among Multicopy Genes Prior to Inferring Phylogeny, With Special Emphasis on the Triticeae (Poaceae).” Hereditas 135: 123–138. - PubMed
    1. Berger, S. A. , Krompass D., and Stamatakis A.. 2011. “Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads Under Maximum Likelihood.” Systematic Biology 60: 291–302. 10.1093/sysbio/syr010. - DOI - PMC - PubMed

LinkOut - more resources