Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 10;121(37):e2403897121.
doi: 10.1073/pnas.2403897121. Epub 2024 Sep 6.

Plasma cell-free RNA signatures of inflammatory syndromes in children

Affiliations

Plasma cell-free RNA signatures of inflammatory syndromes in children

Conor J Loy et al. Proc Natl Acad Sci U S A. .

Abstract

Inflammatory syndromes, including those caused by infection, are a major cause of hospital admissions among children and are often misdiagnosed because of a lack of advanced molecular diagnostic tools. In this study, we explored the utility of circulating cell-free RNA (cfRNA) in plasma as an analyte for the differential diagnosis and characterization of pediatric inflammatory syndromes. We profiled cfRNA in 370 plasma samples from pediatric patients with a range of inflammatory conditions, including Kawasaki disease (KD), multisystem inflammatory syndrome in children (MIS-C), viral infections, and bacterial infections. We developed machine learning models based on these cfRNA profiles, which effectively differentiated KD from MIS-C-two conditions presenting with overlapping symptoms-with high performance [test area under the curve = 0.98]. We further extended this methodology into a multiclass machine learning framework that achieved 80% accuracy in distinguishing among KD, MIS-C, viral, and bacterial infections. We further demonstrated that cfRNA profiles can be used to quantify injury to specific tissues and organs, including the liver, heart, endothelium, nervous system, and the upper respiratory tract. Overall, this study identified cfRNA as a versatile analyte for the differential diagnosis and characterization of a wide range of pediatric inflammatory syndromes.

Keywords: cell-free RNA; diagnostics; inflammation; machine learning; pediatrics.

PubMed Disclaimer

Conflict of interest statement

Competing interests statement:C.J.L and I.D.V are inventors on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367 and 63/429,733). I.D.V. is listed as an inventor on submitted patents pertaining to cell-free nucleic acids (US patent applications 63/237,367, 63/056,249, 63/015,095, 16/500,929, 41614P-10551-01-US) and receives consulting fees from Eurofins Viracor. C.A.R. has received institutional support from ModernaTX, Inc., Pfizer Inc., BioFire Inc., GSK plc, MedImmune, Micron Technology Inc., Janssen Pharmaceuticals, Merck & Co., Inc., Novavax, PaxVax, Regeneron, and Sanofi Pasteur. She is co-inventor of patented RSV vaccine technology which has been licensed to Meissa Vaccines, Inc. C.Y.C. receives grant funding for research unrelated to this work from the Bay Area Lyme Disease Foundation, the Chan-Zuckerberg Biohub, and Abbott Laboratories, Inc.

Figures

Fig. 1.
Fig. 1.
Sample overview. (A) Sample counts and distribution of hospital of origin for each disease group. “Other” indicates other hospitalized pediatric controls (B) Age distribution, (C) C-reactive protein (CRP) levels, (D) sex distribution, and (E) ICU status distribution for each sample group.
Fig. 2.
Fig. 2.
Shared cfRNA signatures among different inflammation syndromes. (A) Differential abundance analysis using DESeq2 was performed between healthy controls and each other group individually. Vertical columns indicate the number of overlapping genes that are significantly differentially abundant between groups and controls (BH adjusted P-values < 0.05). Dots below bars indicate the groups being intersected. Horizontal columns indicate the total number of DAGs between groups and controls. (B) Significantly enriched pathways in the set of genes found to be differentially abundant between healthy controls and each disease group. Average P-value and fold change used for pathway analysis (Qiagen, IPA).
Fig. 3.
Fig. 3.
cfRNA distinguishes KD and MIS-C. (A) Overview of sample set and modeling scheme. (B) Volcano plot of differentially abundant transcripts between MIS-C and KD. Analysis was performed using the training dataset (DESeq2). (C) Adjusted P-value, base mean, and gene ROC AUC distributions for all significant genes from the training KD vs. MIS-C comparison. The red dashed line represents the threshold cutoff used for filtering prior to model training. The green shaded area indicates genes used. (D) ROC AUC values for the 14 machine learning classification models applied to training and validation sets. (E) ROC curves of the training, validation, and test sets using the GLMNET with LASSO regression algorithm. (F) Violin plots of the classifier scores from the GLMNET with LASSO regression algorithm.
Fig. 4.
Fig. 4.
Multiclass classification of pediatric disease using cfRNA. (A) Overview of machine learning framework used for multiclass classification. (B) ROC-AUC plots for each one-vs.-one model trained, along with train (Top) and test (Bottom) classifier score distributions. (C) Confusion matrix of reference and predicted diagnoses for train and test samples. Color indicates fraction of samples in each category.
Fig. 5.
Fig. 5.
cfRNA as a clinical decision support tool. (A) Median scaled cell type of origin fractions for samples separated by liver damage, cardiac function, COVID-19 severity, and endothelial damage, and healthy controls (Materials and Methods). Stars indicate statistically significant differences between groups in comparison (Wilcoxon rank-sum test, BH adjusted P-value < 0.05). (BE) Case studies of patients with example clinical decision support tool results from the multiclass algorithm, along with measurements of endothelium, heart, liver, lung, and neuronal damage from the deconvolution data. Sample shown in red and healthy donor samples in gray. Z-scores calculated relative to healthy donor sample distributions. Endothelium refers to endothelial cell, heart to cardiac muscle cells, liver to hepatocyte, lung to club cell and type I pneumocyte, and neuronal to Schwann cell cfRNA cell type of origin fractions.

Update of

References

    1. Hisamuddin E., Hisam A., Wahid S., Raza G., Validity of C-reactive protein (CRP) for diagnosis of neonatal sepsis. Pak. J. Med. Sci. 31, 527–531 (2015). - PMC - PubMed
    1. Tan M., Lu Y., Jiang H., Zhang L., The diagnostic accuracy of procalcitonin and C-reactive protein for sepsis: A systematic review and meta-analysis. J. Cell. Biochem. 120, 5852–5859 (2019). - PubMed
    1. Servellita V., et al. , A diagnostic classifier for gene expression-based identification of early Lyme disease. Commun. Med. 2, 92 (2022). - PMC - PubMed
    1. Habgood-Coote D., et al. , Diagnosis of childhood febrile illness using a multi-class blood RNA molecular signature. Med. 4, 635–654.e5 (2023). - PubMed
    1. Kalantar K. L., et al. , Integrated host-microbe plasma metagenomics for sepsis diagnosis in a prospective cohort of critically ill adults. Nat. Microbiol. 7, 1805–1816 (2022). - PMC - PubMed

MeSH terms

Supplementary concepts