Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 11:2022:baac032.
doi: 10.1093/database/baac032.

met v1: expanding on old estimations of biodiversity from eDNA with a new database framework

Affiliations

met v1: expanding on old estimations of biodiversity from eDNA with a new database framework

David C Molik. Database (Oxford). .

Abstract

A long-standing problem in environmental DNA has been the inability to compute across large number of datasets. Here we introduce an open-source software framework that can store a large number of environmental DNA datasets, as well as provide a platform for analysis, in an easily customizable way. We show the utility of such an approach by analyzing over 1400 arthropod metabarcode datasets. This article introduces a new software framework, met, which utilizes large numbers of metabarcode datasets to draw conclusions about patterns of diversity at large spatial scales. Given more accurate estimations on the distribution of variance in metabarcode datasets, this software framework could facilitate novel analyses that are outside the scope of currently available similar platforms. Database URL https://osf.io/spb8v/.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) Map of the 515 samples with latitude and longitude data. Samples tended to tightly cluster around locations, correlating with particular biodiversity assay experiments. (B) Number of sequences found per ASV, sorted by the number of ASVs found. If each ASV was counted across all datasets, it would necessitate an n2 operation of all sequences compared to all other sequences. Most analysis software have some solution to this all-on-all problem. met overcomes this difficulty by storing ASVs in a separate table so that this operation becomes a ‘n’ operation of grouping and counting the ASV’s associated datasets. The inferred ASV diversity followed an exponential function, with a substantially long tail. (C) Cumulative plot of any particular ASV found across samples. The plot is reverse sorted by count of samples in which the ASV is found. Although it may not look like it to the eye, no single sequence was found in over 20 datasets. (D) A diagram of met’s different pieces: met-api is composed of three major components: met-analysis, met-api and met-db. met-analysis is the main point of entry for the framework. Data gathered by crawlers would be inserted via met-analysis, and data for further downstream computation would come out of met-analysis. met-api is the only entry point for met-db, and met-db contains all information an analysis project may be interested in.

Similar articles

References

    1. Armitage D.W. (2017) Linking the development and functioning of a carnivorous pitcher plant’s microbial digestive community. ISME J., 11, 2439. doi: 10.1038/ismej.2017.99 - DOI - PMC - PubMed
    1. Buschmann T. and Bystrykh L.V. (2013) Levenshtein error-correcting barcodes for multiplexed DNA sequencing. BMC Bioinformatics., 14, 1–10. doi: 10.1186/1471-2105-14-272 - DOI - PMC - PubMed
    1. Caporaso J.G., Paszkiewicz K., Field D. et al. (2012) The western english channel contains a persistent microbial seed bank. ISME J., 6, 1089–1093. doi: 10.1038/ismej.2011.162 - DOI - PMC - PubMed
    1. Compeau P.E., Pevzner P.A. and Tesler G. (2011) How to apply de Bruijn graphs to genome assembly. Nat. Biotechnol., 29, 987–991. doi: 10.1038/nbt.2023 - DOI - PMC - PubMed
    1. Crits-Christoph A., Robinson C.K., Barnum T. et al. (2013) Colonization patterns of soil microbial communities in the Atacama Desert. Microbiome, 1, 28. doi: 10.1186/2049-2618-1-28 - DOI - PMC - PubMed

Publication types

Substances