Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 21;8(6):e0072423.
doi: 10.1128/msystems.00724-23. Epub 2023 Nov 2.

Informed interpretation of metagenomic data by StrainPhlAn enables strain retention analyses of the upper airway microbiome

Affiliations

Informed interpretation of metagenomic data by StrainPhlAn enables strain retention analyses of the upper airway microbiome

Nadja Mostacci et al. mSystems. .

Abstract

The usage of 16S rRNA gene sequencing has become the state-of-the-art method for the characterization of the microbiota in health and respiratory disease. The method is reliable for low biomass samples due to prior amplification of the 16S rRNA gene but has limitations as species and certainly strain identification is not possible. However, the usage of metagenomic tools for the analyses of microbiome data from low biomass samples is not straight forward, and careful optimization is needed. In this work, we show that by validating StrainPhlAn 3 results with the data from bacterial cultures, the strain-level tracking of the respiratory microbiome is feasible despite the high content of host DNA being present when parameters are carefully optimized to fit low biomass microbiomes. This work further proposes that strain retention analyses are feasible, at least for more abundant species. This will help to better understand the longitudinal dynamics of the upper respiratory microbiome during health and disease.

Keywords: bacterial culture; genome analysis; metagenomics; respiratory tract; strain resolution.

PubMed Disclaimer

Conflict of interest statement

L.S., S.K., and O.S. are employees of Société des Produits Nestlé (SPN). P.L. reports grants (from Vertex and OM Pharma), lecture fees (from Vertex, Vifor, and OM Pharma), and participation on data and safety monitoring boards or advisory boards (for Polyphor, Santhera, Vertex, OM Pharma, Vifor, and Sanofi Avenits), outside the submitted work. M.H. reports a grant (from Pfizer) and participation on advisory boards (for Pfizer and MSD), outside the submitted work.

Figures

Fig 1
Fig 1
Benchmarking the different set of parameters. The default set of parameters is illustrated in run No. 0 and is BREADTH_THRESHOLD 80, TRIM_SEQUENCES 50, MARKER_IN_N_SAMPLES 80, and SAMPLE_WITH_N_MARKERS 20. Panels a, b, c, and d represent benchmarking for NP data; e and f for OP data. The default (No. 0) and 17 different sets of parameters (No. 1–17) are illustrated on the x-axis, and their characteristics are described in Table S1. Gray = specificity, yellow = sensitivity, and green = F1-score. The optimized sets of parameters are shown at No. 7 (indicated in red).
Fig 2
Fig 2
StrainPhlAn-derived trees from NP data sets. Trees are StrainPhAn 3 generated for (a) S. pneumoniae, (b) M. catarrhalis, (c) S. aureus. and (d) H. influenzae from NP data and were using the optimized parameters of StrainPhlAn 3. Samples are indicated in different colors depending on the presence/absence of culture data [i.e. purple = found in StrainPhlAn but not culture, yellow = found with StrainPhlAn and culture, and gray = found with StrainPhlAn but no culture data available (NA)].
Fig 3
Fig 3
Phylogenetic trees of S. aureus from OP data sets. On the left, core genome tree from WGS data of S. aureus isolates. On the right, the phylogenetic tree was created from metagenomic data of OP samples using the adjusted parameters of StraiPhlAn. Sample IDs are indicated in black.
Fig 4
Fig 4
Strain retention analyses based on normalized genetic distances of four different species of the NP data set. The all-versus-all normalized genetic distances have been separately calculated for S. pneumoniae, M. catarrhalis, H. influenzae, and S. aureus. Values were binned in intervals of 0.1. Bins with values 0.0–0.1 (for S. aureus and H. influenzae) and 0.0–0.2 (for S. pneumoniae and M. catarrhalis) were defined for strain retention (dotted lines; see text for details).
Fig 5
Fig 5
Clustering and longitudinal visualization of S. aureus strains of OP samples. Phylogenetic tree was created with StrainPhlAn using the new parameters from metagenomic data, highlighting 10 clusters (1-J) (Fig. 5A). Strain retention of S. aureus was investigated in a total of fifteen individuals with CF, sampled for up to the first 10 years of life (Fig. 5B). Strain retainment was investigated by (i) extracting the MLST from the WGS from S. aureus (SA) and (ii) using the clustering information from metagenomic data (shown as SA_clusters from A-J with different colors). For the detection of SA we used four categories. No S. aureus found by culture and metagenomic sequencing (indicated by an empty circle), S. aureus found by culture and metagenomic sequencing (filled triangle), S. aureus found by culture but not metagenomic sequencing (empty triangle), and S. aureus found by metagenomic sequencing but not by culture (filled circle). StrainPhlAn clusters are indicated with different colors. Numbers reflect the sequence types from MLST found in the respective S. aureus isolates. As for triangles without numbers, S. aureus by culture has been reported, but the isolate has not been kept for WGS.

References

    1. Ritchie AI, Singanayagam A. 2020. Metagenomic characterization of the respiratory microbiome. a piece de resistance. Am J Respir Crit Care Med 202:321–322. doi: 10.1164/rccm.202005-1686ED - DOI - PMC - PubMed
    1. Mika M, Korten I, Qi W, Regamey N, Frey U, Casaulta C, Latzin P, Hilty M, SCILD study group . 2016. The nasal microbiota in infants with cystic fibrosis in the first year of life: a prospective cohort study. Lancet Respir Med 4:627–635. doi: 10.1016/S2213-2600(16)30081-9 - DOI - PubMed
    1. de Koff EM, Groot KM de W, Bogaert D. 2016. Development of the respiratory tract microbiota in cystic fibrosis. Curr Opin Pulm Med 22:623–628. doi: 10.1097/MCP.0000000000000316 - DOI - PubMed
    1. Prevaes SMPJ, de Winter-de Groot KM, Janssens HM, de Steenhuijsen Piters WAA, Tramper-Stranders GA, Wyllie AL, Hasrat R, Tiddens HA, van Westreenen M, van der Ent CK, Sanders EAM, Bogaert D. 2016. Development of the nasopharyngeal microbiota in infants with cystic fibrosis. Am J Respir Crit Care Med 193:504–515. doi: 10.1164/rccm.201509-1759OC - DOI - PubMed
    1. Hilty M, Burke C, Pedro H, Cardenas P, Bush A, Bossley C, Davies J, Ervine A, Poulter L, Pachter L, Moffatt MF, Cookson WOC. 2010. Disordered microbial communities in asthmatic airways. PLoS One 5:e8578. doi: 10.1371/journal.pone.0008578 - DOI - PMC - PubMed

Substances