Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec 17;20(Suppl 20):637.
doi: 10.1186/s12859-019-3204-8.

Deconvoluting the diversity of within-host pathogen strains in a multi-locus sequence typing framework

Affiliations

Deconvoluting the diversity of within-host pathogen strains in a multi-locus sequence typing framework

Guo Liang Gan et al. BMC Bioinformatics. .

Abstract

Background: Bacterial pathogens exhibit an impressive amount of genomic diversity. This diversity can be informative of evolutionary adaptations, host-pathogen interactions, and disease transmission patterns. However, capturing this diversity directly from biological samples is challenging.

Results: We introduce a framework for understanding the within-host diversity of a pathogen using multi-locus sequence types (MLST) from whole-genome sequencing (WGS) data. Our approach consists of two stages. First we process each sample individually by assigning it, for each locus in the MLST scheme, a set of alleles and a proportion for each allele. Next, we associate to each sample a set of strain types using the alleles and the strain proportions obtained in the first step. We achieve this by using the smallest possible number of previously unobserved strains across all samples, while using those unobserved strains which are as close to the observed ones as possible, at the same time respecting the allele proportions as closely as possible. We solve both problems using mixed integer linear programming (MILP). Our method performs accurately on simulated data and generates results on a real data set of Borrelia burgdorferi genomes suggesting a high level of diversity for this pathogen.

Conclusions: Our approach can apply to any bacterial pathogen with an MLST scheme, even though we developed it with Borrelia burgdorferi, the etiological agent of Lyme disease, in mind. Our work paves the way for robust strain typing in the presence of within-host heterogeneity, overcoming an essential challenge currently not addressed by any existing methodology for pathogen genomics.

Keywords: Bacterial diversity; Integer Linear Programming; Multi-Locus Sequence Typing.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
A dataset with two samples and an MLST scheme of three loci (genes clpA, clpX, nifS). The strain type distributions require 5 different strains as the strain (clpA_1,clpX_1, nifS_7) appears in both distributions
Fig. 2
Fig. 2
Distribution of the number of existing and novel strains per tick sample
Fig. 3
Fig. 3
(Left) Cumulative proportion of the 10 existing strains in all 24 samples (within each bar, different colors represent different samples). (Right) Similar graph for the 60 novel strains

References

    1. Didelot X, Walker AS, Peto TE, Crook DW, Wilson DJ. Within-host evolution of bacterial pathogens. Nat Rev Microbiol. 2016;14(3):150–62. doi: 10.1038/nrmicro.2015.13. - DOI - PMC - PubMed
    1. Cadena AM, Fortune SM, Flynn JL. Heterogeneity in tuberculosis. Nat Rev Immunol. 2017;17:691. doi: 10.1038/nri.2017.69. - DOI - PMC - PubMed
    1. Tyler AD, Randell E, Baikie M, Antonation K, Janella D, Christianson S, Tyrrell GJ, Graham M, Van Domselaar G, Sharma MK. Application of whole genome sequence analysis to the study of Mycobacterium tuberculosis in Nunavut, Canada. PLoS ONE. 2017;12(10):0185656. doi: 10.1371/journal.pone.0185656. - DOI - PMC - PubMed
    1. Alizon Samuel, de Roode Jacobus C., Michalakis Yannis. Multiple infections and the evolution of virulence. Ecology Letters. 2013;16(4):556–567. doi: 10.1111/ele.12076. - DOI - PubMed
    1. Strandh Maria, Råberg Lars. Within-host competition between Borrelia afzelii ospC strains in wild hosts as revealed by massively parallel amplicon sequencing. Philosophical Transactions of the Royal Society B: Biological Sciences. 2015;370(1675):20140293. doi: 10.1098/rstb.2014.0293. - DOI - PMC - PubMed

LinkOut - more resources