Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 10;15(1):4949.
doi: 10.1038/s41467-024-49245-6.

Circulating cell-free RNA in blood as a host response biomarker for detection of tuberculosis

Affiliations

Circulating cell-free RNA in blood as a host response biomarker for detection of tuberculosis

Adrienne Chang et al. Nat Commun. .

Abstract

Tuberculosis (TB) remains a leading cause of death from an infectious disease worldwide, partly due to a lack of effective strategies to screen and triage individuals with potential TB. Whole blood RNA signatures have been tested as biomarkers for TB, but have failed to meet the World Health Organization's (WHO) optimal target product profiles (TPP). Here, we use RNA sequencing and machine-learning to investigate the utility of plasma cell-free RNA (cfRNA) as a host-response biomarker for TB in cohorts from Uganda, Vietnam and Philippines. We report a 6-gene cfRNA signature, which differentiates TB-positive and TB-negative individuals with AUC = 0.95, 0.92, and 0.95 in test, training and validation, respectively. This signature meets WHO TPPs (sensitivity: 97.1% [95% CI: 80.9-100%], specificity: 85.2% [95% CI: 72.4-100%]) regardless of geographic location, sample collection method and HIV status. Overall, our results identify plasma cfRNA as a promising host response biomarker to diagnose TB.

PubMed Disclaimer

Conflict of interest statement

A. Chang, C.J.L, D.E.L, and I.D.V are inventors on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367 and 63/429,733). I.D.V. is a member of the Scientific Advisory Board of Karius Inc., Kanvas Biosciences, and GenDX. I.D.V. is listed as an inventor on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367, 63/056,249, 63/015,095, 16/500,929, 41614P-10551-01-US) and receives consulting fees from Eurofins Viracor. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Plasma cell-free RNA profiling.
A Geographic distribution of samples included in this study, which originated from three different cohorts. B Differences in cfRNA cell-types-of-origin between the three cohorts. Light-blue = Platelet, red = Myeloid-progenitor, yellow = B cell, dark-green = Endothelial cell, maroon = NK cell, dark-blue = Monocyte, light-green = Neutrophil, beige = Other. C VST normalized counts of significantly differentially abundant genes (two sided Wald test, Benjamini-Hochberg adjusted p-value < 0.05). Samples and genes are clustered based on correlation. Country: dark-green = Uganda, yellow = Vietnam, red = Philippines; Cohort: light-blue = Cohort 1, purple = Cohort 2, gold = Cohort 3; HIV status: orange = positive, maroon = negative; TB status: pink = positive, dark-blue = negative. D Top 21 differential pathways between microbiologically confirmed TB diagnoses ranked by significance. Z-score indicated in corresponding bar; direction relative to TB negative samples. E Flowchart of the method used to train, test, and validate the machine learning classification algorithms. F Area under the receiver operating characteristic curve (ROC-AUC) metrics for training and test sets across 15 machine learning models (test set = triangle, training set = circle).
Fig. 2
Fig. 2. Performance of the 6-gene signature identified in plasma cfRNA.
A Area under the receiver operating characteristic curve (ROC-AUC) as a function of the gene is added in each iteration of a greedy forward search model performed on the training dataset. B Train (dark-green), test (purple), and validation (gold) performance of the greedy forward search algorithm in distinguishing microbiologically confirmed TB. C Violin plot of classifier scores for training, test, and validation sets using the greedy forward search algorithm (TB positive = pink, TB negative = blue; HIV positive = triangle, HIV negative = circle). The dashed line represents the optimal Youden threshold cutoff (0.498) which was determined solely on the training set. D VST normalized counts of the most significant predictor GBP5 across both cohort and semiquantitative TB status (blue = negative, dark-green = low, yellow = medium, red = high). Boxes in the boxplots indicate the 25th and 75th percentiles, the band in the box represents the median, and whiskers extend to 1.5 x interquartile range of the hinge. E Performance of the 6-gene TB score when the model is re-evaluated without HIV-positive individuals. F Performance of the 6-gene TB score when separating samples by country (red = Uganda, green = Vietnam, light-blue = Philippines). G Correlation of the 6-gene TB score with the Xpert Ultra Semi-quantitative Result (dotted line: classification score threshold = 0.498; center: mean; bars indicate 95% confidence interval +/− SEM). Light-blue = Cohort 1, brown = Cohort 2, gold = Cohort 3. H Correlation of the 6-gene TB score with three chest X-ray scores. Color indicates disease status (pink = TB positive; blue = TB negative) Error bands represent the 95% CI around the LOESS smoothed line.
Fig. 3
Fig. 3. Comparison of the 6-gene plasma signature with whole blood signatures of TB from literature.
A Overlap between top-performing whole blood signatures (dark-blue) and the 6-gene signature (pink). Top 29 overlapping genes are shown. B Performance comparison of whole blood signatures from previous multi-cohort studies (dark-blue) with our 6-gene cfRNA signature (pink). Optimal triage thresholds are marked in light-blue (95% sensitivity, 80% specificity), minimal triage thresholds are marked in green (90% sensitivity, 70% specificity). Error bars represent the 95% confidence intervals. Samples sizes (n) are as follows: cfRNA (61), Berry86 (116), daCosta2 (54), daCosta3 (54), Kaforou44 (103), Walter47 (180), Zak16 (49), Sweeney3 (787). C Comparison of solid organ fraction between the 3 cfRNA cohorts (light-blue, brown, gold) and the whole blood samples (dark-blue). “****” represents a two-sided Wilcoxin test, Benjamini-Hochberg adjusted p-value < 1e-15. Boxes in the boxplots indicate the 25th and 75th percentiles, the band in the box represents the median, and whiskers extend to 1.5 × interquartile range of the hinge. D Venn diagram depicting the overlap of statistically significant, differentially expressed genes between whole blood and plasma cfRNA. E Performance of GBP5 cfRNA abundance in distinguishing active TB. F Performance comparison of whole blood protein GBP5, whole blood RNA GBP5, and plasma cfRNA GBP5 in distinguishing active TB.

Update of

References

    1. Chandra P, Grigsby SJ, Philips JA. Immune evasion and provocation by Mycobacterium tuberculosis. Nat. Rev. Microbiol. 2022;20:750–766. doi: 10.1038/s41579-022-00763-4. - DOI - PMC - PubMed
    1. Singhania A, Wilkinson RJ, Rodrigue M, Haldar P, O’Garra A. The value of transcriptomics in advancing knowledge of the immune response and diagnosis in tuberculosis. Nat. Immunol. 2018;19:1159–1168. doi: 10.1038/s41590-018-0225-9. - DOI - PMC - PubMed
    1. Drain PK. Incipient and Subclinical Tuberculosis: a Clinical Review of Early Stages and Progression of Infection. Clin. Microbiol. Rev. 2018;31:00021–18. doi: 10.1128/CMR.00021-18. - DOI - PMC - PubMed
    1. Mulenga H. Performance of diagnostic and predictive host blood transcriptomic signatures for Tuberculosis disease: A systematic review and meta-analysis. PLOS ONE. 2020;15:0237574. doi: 10.1371/journal.pone.0237574. - DOI - PMC - PubMed
    1. Zak DE. A blood RNA signature for tuberculosis disease risk: a prospective cohort study. Lancet Lond. Engl. 2016;387:2312–2322. doi: 10.1016/S0140-6736(15)01316-1. - DOI - PMC - PubMed