Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 29;22(1):802.
doi: 10.1186/s12967-024-05241-4.

Integration and validation of host transcript signatures, including a novel 3-transcript tuberculosis signature, to enable one-step multiclass diagnosis of childhood febrile disease

Affiliations

Integration and validation of host transcript signatures, including a novel 3-transcript tuberculosis signature, to enable one-step multiclass diagnosis of childhood febrile disease

Samuel Channon-Wells et al. J Transl Med. .

Abstract

Background: Whole blood host transcript signatures show great potential for diagnosis of infectious and inflammatory illness, with most published signatures performing binary classification tasks. Barriers to clinical implementation include validation studies, and development of strategies that enable simultaneous, multiclass diagnosis of febrile illness based on gene expression.

Methods: We validated five distinct diagnostic signatures for paediatric infectious diseases in parallel using a single NanoString nCounter® experiment. We included a novel 3-transcript signature for childhood tuberculosis, and four published signatures which differentiate bacterial infection, viral infection, or Kawasaki disease from other febrile illnesses. Signature performance was assessed using receiver operating characteristic curve statistics. We also explored conceptual frameworks for multiclass diagnostic signatures, including additional transcripts found to be significantly differentially expressed in previous studies. Relaxed, regularised logistic regression models were used to derive two novel multiclass signatures: a mixed One-vs-All model (MOVA), running multiple binomial models in parallel, and a full-multiclass model. In-sample performance of these models was compared using radar-plots and confusion matrix statistics.

Results: Samples from 91 children were included in the study: 23 bacterial infections (DB), 20 viral infections (DV), 14 Kawasaki disease (KD), 18 tuberculosis disease (TB), and 16 healthy controls. The five signatures tested demonstrated cross-platform performance similar to their primary discovery-validation cohorts. The signatures could differentiate: KD from other diseases with area under ROC curve (AUC) of 0.897 [95% confidence interval: 0.822-0.972]; DB from DV with AUC of 0.825 [0.691-0.959] (signature-1) and 0.867 [0.753-0.982] (signature-2); TB from other diseases with AUC of 0.882 [0.787-0.977] (novel signature); TB from healthy children with AUC of 0.910 [0.808-1.000]. Application of signatures outside of their designed context reduced performance. In-sample error rates for the multiclass models were 13.3% for the MOVA model and 0.0% for the full-multiclass model. The MOVA model misclassified DB cases most frequently (18.7%) and TB cases least (2.7%).

Conclusions: Our study demonstrates the feasibility of NanoString technology for cross-platform validation of multiple transcriptomic signatures in parallel. This external cohort validated performance of all five signatures, including a novel sparse TB signature. Two exploratory multi-class models showed high potential accuracy across four distinct diagnostic groups.

Keywords: Bacterial infection; Diagnostics; Gene expression; Kawasaki disease; Multiclass diagnostics; Tuberculosis; Viral infection.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following filed patents and patent applications, relating to specific transcripts described in this manuscript: 2304229.4/GB/PRV, 23-03-2023; EP4200860A1/WO, 24-08-2021; CA3135429A1, 04-08-2018; ES2941905T3, 12-07-2017.

Figures

Fig. 1
Fig. 1
Approaches to Diagnostic testing. A Simple schematic approach to clinical diagnostics, with reliance on traditional microbiological testing. B Application of binary diagnostic signatures in parallel, demonstrating problems relating to overlapping or contradictory results. C Application of binary signatures in series, demonstrating carry-forward error in classification. D Application of multiclass signature, avoiding these limitations. Created with BioRender.com
Fig. 2
Fig. 2
Study overview. Schematic overview of study recruitment and analysis. Created with BioRender.com
Fig. 3
Fig. 3
Performance of existing signatures. Plots of Disease Risk Scores by category (left) and ROC-curves (right) for five signatures. A and B Wright13 signature, with boxplots of the DRS by category A and ROC curves of the DRS and LR-probability (B). C and D Herberg2 signature, with boxplots of the DRS by category C and ROC curves of the DRS, LR-probability, and individual transcripts (D). E and F Pennisi2 signature, with boxplots of the DRS by category (E) and ROC curves of the DRS, LR-probability, and individual transcripts (F). G and H TB3 signature, with boxplots of the DRS by category (G) and ROC curves of the DRS, LR-probability, and individual transcripts (H). I and J BATF2, with boxplots of expression by category (I) and ROC-curves of BATF2 expression tasked with differentiating active TB from either controls or other disease groups J. 95% confidence intervals for ROC-curves are included for the DRS and LR-probability only in panels B, D, F and H
Fig. 4
Fig. 4
TB3 signature performance in microarray training and validation cohorts. Boxplots of the DRS A of the TB3 signature and the correspondent ROC curves B in the training, test, and validation sets
Fig. 5
Fig. 5
In-sample radar plots for multiclass signatures. Radar plots showing the probabilities of each class predicted by A the MOVA-model, B the Multiclass model. Probabilities separated and coloured by actual disease: Red = DB; Blue = DV; Purple = TB; Yellow = KD

References

    1. Liu L, Oza S, Hogan D, et al. Global, regional, and national causes of under-5 mortality in 2000–15: an updated systematic analysis with implications for the sustainable development goals. Lancet. 2016;388(10063):3027–35. 10.1016/S0140-6736(16)31593-8 - DOI - PMC - PubMed
    1. Nijman RG, Jorgensen R, Levin M, Herberg J, Maconochie IK. Management of children with fever at risk for pediatric sepsis: a prospective study in pediatric emergency care. Front Pediatr. 2020;8:548154. 10.3389/fped.2020.548154 - DOI - PMC - PubMed
    1. Martinon-Torres F, Salas A, Rivero-Calle I, et al. Life-threatening infections in children in Europe (the EUCLIDS Project): a prospective cohort study. Lancet Child Adolesc Health. 2018;2(6):404–14. 10.1016/S2352-4642(18)30113-5 - DOI - PubMed
    1. Moore A, Harnden A, Mayon-White R. Recognising Kawasaki disease in UK primary care: a descriptive study using the clinical practice research datalink. Br J Gen Pract. 2014;64(625):e477–83. 10.3399/bjgp14X680953 - DOI - PMC - PubMed
    1. Lee JH, Garg T, Lee J, et al. Impact of molecular diagnostic tests on diagnostic and treatment delays in tuberculosis: a systematic review and meta-analysis. BMC Infect Dis. 2022;22(1):940. 10.1186/s12879-022-07855-9 - DOI - PMC - PubMed

Publication types

LinkOut - more resources