Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 11;13(3):635.
doi: 10.3390/microorganisms13030635.

Interpreting Microbial Species-Area Relationships: Effects of Sequence Data Processing Algorithms and Fitting Models

Affiliations

Interpreting Microbial Species-Area Relationships: Effects of Sequence Data Processing Algorithms and Fitting Models

Fu-Liang Qi et al. Microorganisms. .

Abstract

In the study of Species-Area Relationships (SARs) in microorganisms, outcome discrepancies primarily stem from divergent high-throughput sequencing data processing algorithms and their combinations with different fitting models. This paper investigates the impacts and underlying causes of using diverse sequence data processing algorithms in microbial SAR studies, as well as compatibility issues that arise between different algorithms and fitting models. The findings indicate that the balancing strategies employed by different algorithms can result in variations in the calculations of alpha and beta diversity, thereby influencing the SARs of microorganisms. Crucially, incompatibilities exist between algorithms and models, with no consistently optimal combination identified. Based on these insights, we recommend prioritizing the use of the DADA2 algorithm in conjunction with a power model, which demonstrates greater compatibility. This study serves as a comprehensive comparison and reference for fundamental methods in microbial SAR research. Future microbial SAR studies should carefully select the most appropriate algorithms and models based on specific research objectives and data structures.

Keywords: high-throughput sequencing; microbial diversity patterns; model fitting; species–area relationship.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
SAR curves under different algorithms.
Figure 2
Figure 2
The slopes of the SAR curves under four different algorithms. The slope of the SAR curve, derived from fitting the power model, was analyzed along with p-values obtained through a Kruskal–Wallis test.
Figure 3
Figure 3
Comparison of alpha diversity and beta diversity among four algorithms. (a) displays the overall rarefaction curves for the four algorithms based on species richness; (b) illustrates the overall species abundance rank curve for the four algorithms; (c) presents a beta diversity box plot comparing samples across the four algorithms, with p-values derived from a Kruskal–Wallis test indicating differences; (d) conducts a beta diversity partitioning analysis on eight sample datasets from the four algorithms, where each black dot represents the comparison value between two samples. The positions of the dots are determined by the Richness Difference, Replacement, and Similarity, with the sum of each triplet equaling one. Larger black dots indicate the centroids of the points, representing the average values of Richness Difference, Replacement, and Similarity.
Figure 4
Figure 4
Comparison and verification of algorithm and model fitting results. (a) A scatter plot of AICc values and R2 for four algorithms fitted with 20 models, where data with R2 values less than zero are considered invalid. The p-value is calculated using the Kruskal–Wallis method to test for differences. (b) A scatter plot of AICc values and R2 for the best models across the four algorithms; different colors represent different algorithms, while different shapes denote various models. The p-value is also calculated using the Kruskal–Wallis method to test for differences. (c) The frequency of the best models under the four algorithms, where larger bubbles indicate a greater number of optimal fitting occurrences for each model.

References

    1. Macarthur R.H., Wilson E.O. An Equilibrium Theory of Insular Zoogeography. Evol. Int. J. Org. Evol. 1963;17:373–387. doi: 10.2307/2407089. - DOI
    1. Lomolino M.V. Ecology’s most general, yet protean pattern: The species-area relationship. J. Biogeogr. 2000;27:17–26. doi: 10.1046/j.1365-2699.2000.00377.x. - DOI
    1. Tjørve E., Tjørve K.M. Encyclopedia of Life Sciences. Wiley; New York, NY, USA: 2017. Species–Area Relationship; pp. 1–9. - DOI
    1. Martiny J.B.H., Bohannan B.J.M., Brown J.H., Colwell R.K., Fuhrman J.A., Green J.L., Horner-Devine M.C., Kane M., Krumins J.A., Kuske C.R., et al. Microbial biogeography: Putting microorganisms on the map. Nat. Rev. Microbiol. 2006;4:102–112. doi: 10.1038/nrmicro1341. - DOI - PubMed
    1. Zhou J., Ning D. Stochastic Community Assembly: Does It Matter in Microbial Ecology? Microbiol. Mol. Biol. Rev. 2017;81 doi: 10.1128/MMBR.00002-17. - DOI - PMC - PubMed

LinkOut - more resources