Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Dec 29:2023.12.28.23300451.
doi: 10.1101/2023.12.28.23300451.

Signatures of transmission in within-host M. tuberculosis variation

Affiliations

Signatures of transmission in within-host M. tuberculosis variation

Katharine S Walter et al. medRxiv. .

Update in

Abstract

Background: Because M. tuberculosis evolves slowly, transmission clusters often contain multiple individuals with identical consensus genomes, making it difficult to reconstruct transmission chains. Finding additional sources of shared M. tuberculosis variation could help overcome this problem. Previous studies have reported M. tuberculosis diversity within infected individuals; however, whether within-host variation improves transmission inferences remains unclear.

Methods: To evaluate the transmission information present in within-host M. tuberculosis variation, we re-analyzed publicly available sequence data from three household transmission studies, using household membership as a proxy for transmission linkage between donor-recipient pairs.

Findings: We found moderate levels of minority variation present in M. tuberculosis sequence data from cultured isolates that varied significantly across studies (mean: 6, 7, and 170 minority variants above a 1% minor allele frequency threshold, outside of PE/PPE genes). Isolates from household members shared more minority variants than did isolates from unlinked individuals in the three studies (mean 98 shared minority variants vs. 10; 0.8 vs. 0.2, and 0.7 vs. 0.2, respectively). Shared within-host variation was significantly associated with household membership (OR: 1.51 [1.30,1.71], for one standard deviation increase in shared minority variants). Models that included shared within-host variation improved the accuracy of predicting household membership in all three studies as compared to models without within-host variation (AUC: 0.95 versus 0.92, 0.99 versus 0.95, and 0.93 versus 0.91).

Interpretation: Within-host M. tuberculosis variation persists through culture and could enhance the resolution of transmission inferences. The substantial differences in minority variation recovered across studies highlights the need to optimize approaches to recover and incorporate within-host variation into automated phylogenetic and transmission inference.

Funding: NIAID: 5K01AI173385.

Keywords: M. tuberculosis; epidemiology; genomics; transmission.

PubMed Disclaimer

Conflict of interest statement

Declaration of interest The authors report no conflict of interest.

Figures

Figure 1.
Figure 1.. M. tuberculosis consensus genomes are closely related but are not always predictive of epidemiological linkage.
(a) Histograms indicate pairwise genetic distances between M. tuberculosis consensus genomes, with facets indicating study and pairwise comparison type. (b) Phylogeny of consensus sequences for each study, with branch tips colored to indicate samples from a single household or with known epidemiologic links.
Figure 2.
Figure 2.. Limited M. tuberculosis within-host diversity is recovered with culture-based Illumina sequencing.
Ridgeline plot of the minor allele frequency distribution for five randomly selected samples from each study, indicated by ridge color. Panels indicate genomic region: outside PE/PPE genes and within PE/PPE genes.
Figure 3.
Figure 3.. Pairwise shared variants above a 1% minor allele frequency.
Boxplots of the number of high-quality shared minority variants between sample pairs in three previously published M. tuberculosis transmission studies (columns) with jittered points indicating pairwise observations. Colors indicate comparison type: sample, within-host minority variants; household, minority variants shared between household pairs; unlinked, minority variants shared between individuals in different households. Boxes indicate group interquartile ranges and center lines indicate group medians.
Figure 4.
Figure 4.. Shared minority variants contain information about household membership.
(a) Stacked barplot showing the proportion of sample pairs across different levels of shared minority variants ≥ 1% minor allele frequency threshold. Panels indicate study. (b) ROC curves showing sensitivity (true positive rate) as a function 1 – specificity (true negative rate) for predicting household membership in general linear models that include both shared iSNVs and consensus sequence-based clusters (Full model), consensus sequence-based cluster only (Consensus sequences), and Shared iSNVs only (Shared iSNVs). All models include study as a predictor.

References

    1. Churchyard G. et al. What We Know about Tuberculosis Transmission: An Overview. Journal of Infectious Diseases 216, S629–S635 (2017). - PMC - PubMed
    1. Auld S. C. et al. Extensively drug-resistant tuberculosis in South Africa: Genomic evidence supporting transmission in communities. European Respiratory Journal 52, (2018). - PMC - PubMed
    1. Middelkoop K. et al. Transmission of tuberculosis in a south African community with a high prevalence of HIV infection. Journal of Infectious Diseases 211, 53–61 (2015). - PMC - PubMed
    1. Andrews J. R., Morrow C., Walensky R. P. & Wood R. Integrating social contact and environmental data in evaluating tuberculosis transmission in a South African township. J Infect Dis 210, 597–603 (2014). - PMC - PubMed
    1. Yates T. A. et al. The transmission of Mycobacterium tuberculosis in high burden settings. Lancet Infect Dis 16, 227–238 (2016). - PubMed

Publication types