Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 May 19;11(1):10590.
doi: 10.1038/s41598-021-89881-2.

An inter-laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities

Affiliations
Comparative Study

An inter-laboratory study to investigate the impact of the bioinformatics component on microbiome analysis using mock communities

Denise M O'Sullivan et al. Sci Rep. .

Abstract

Despite the advent of whole genome metagenomics, targeted approaches (such as 16S rRNA gene amplicon sequencing) continue to be valuable for determining the microbial composition of samples. Amplicon microbiome sequencing can be performed on clinical samples from a normally sterile site to determine the aetiology of an infection (usually single pathogen identification) or samples from more complex niches such as human mucosa or environmental samples where multiple microorganisms need to be identified. The methodologies are frequently applied to determine both presence of micro-organisms and their quantity or relative abundance. There are a number of technical steps required to perform microbial community profiling, many of which may have appreciable precision and bias that impacts final results. In order for these methods to be applied with the greatest accuracy, comparative studies across different laboratories are warranted. In this study we explored the impact of the bioinformatic approaches taken in different laboratories on microbiome assessment using 16S rRNA gene amplicon sequencing results. Data were generated from two mock microbial community samples which were amplified using primer sets spanning five different variable regions of 16S rRNA genes. The PCR-sequencing analysis included three technical repeats of the process to determine the repeatability of their methods. Thirteen laboratories participated in the study, and each analysed the same FASTQ files using their choice of pipeline. This study captured the methods used and the resulting sequence annotation and relative abundance output from bioinformatic analyses. Results were compared to digital PCR assessment of the absolute abundance of each target representing each organism in the mock microbial community samples and also to analyses of shotgun metagenome sequence data. This ring trial demonstrates that the choice of bioinformatic analysis pipeline alone can result in different estimations of the composition of the microbiome when using 16S rRNA gene amplicon sequencing data. The study observed differences in terms of both presence and abundance of organisms and provides a resource for ensuring reproducible pipeline development and application. The observed differences were especially prevalent when using custom databases and applying high stringency operational taxonomic unit (OTU) cut-off limits. In order to apply sequencing approaches with greater accuracy, the impact of different analytical steps needs to be clearly delineated and solutions devised to harmonise microbiome analysis results.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
(AD) The % family abundance reported by each laboratory including the nominal and dPCR reported composition for the two materials; MCM2α, variable regions V1–2 (A) and V4–6 (B) and MCM2β, variable regions, V1–2 (C) and V4–6 (D).
Figure 2
Figure 2
(AC) NDMS (non-metric multidimensional scaling) plots to see if results from the 13 laboratories cluster according to OTU assignment tool (A), OTU assignment database (B) and taxa assignment database (C). OTU assignment database was included for laboratories that used closed reference OTU picking. They generally compared their sequences against a reference database of sequences that clustered the reads into OTUs based on sequence similarity. Later, many laboratories assigned taxonomic identifiers to each of these OTUs using a separate database which had sequence data (sometimes not clustered into OTUs) and which taxa that sequence originated from.

Similar articles

Cited by

References

    1. Kitsios GD, Fitch A, Manatakis DV, Rapport SF, Li K, Qin S, et al. Respiratory microbiome profiling for etiologic diagnosis of pneumonia in mechanically ventilated patients. Front. Microbiol. 2018;9(1413):1. - PMC - PubMed
    1. Amelia TSM, Amirul A-AA, Saidin J, Bhubalan K. Identification of cultivable bacteria from tropical marine sponges and their biotechnological potentials. Trop. Life Sci. Res. 2018;29(2):187–199. doi: 10.21315/tlsr2018.29.2.13. - DOI - PMC - PubMed
    1. Tringe SG, Hugenholtz P. A renaissance for the pioneering 16S rRNA gene. Curr. Opin. Microbiol. 2008;11(5):442–446. doi: 10.1016/j.mib.2008.09.011. - DOI - PubMed
    1. Martinez-Porchas M, Villalpando-Canchola E, Ortiz Suarez LE, Vargas-Albores F. How conserved are the conserved 16S-rRNA regions? Peer J. 2017;5:e3036. doi: 10.7717/peerj.3036. - DOI - PMC - PubMed
    1. Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic Acids Res. 2012;41(1):e1. doi: 10.1093/nar/gks808. - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources