Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 19;25(1):400.
doi: 10.1186/s12887-025-05752-z.

Similarity of immune-associated markers in COVID-19 and Kawasaki disease: analyses from bioinformatics and machine learning

Affiliations

Similarity of immune-associated markers in COVID-19 and Kawasaki disease: analyses from bioinformatics and machine learning

Wang Li et al. BMC Pediatr. .

Abstract

Background: Infection by the SARS-CoV-2 virus can cause coronavirus disease 2019 (COVID-19) and can also exacerbate the symptoms of Kawasaki disease (KD), an acute vasculitis that mostly affects children. This study used bioinformatics and machine learning to examine similarities in the molecular pathogenesis of COVID-19 and KD.

Methods: We first identified disease-associated modules in KD using weighted gene co-expression network analysis. Then, we determined shared differentially expressed genes (DEGs) in training datasets for KD (GSE100154) and COVID-19 (GSE225220), performed functional annotation of these shared DEGs, and used Cytoscape plug-ins (MCODE and Cytohubba) to characterize the protein-protein interaction (PPI) network and identify the hub genes. We performed Least Absolute Shrinkage and Selection Operator(LASSO) regression and receiver operating characteristic (ROC) curve analysis to identify the most robust markers, validated these results by analysis of two other datasets (GSE73461 and GSE18606), and then calculated the correlations of these key genes with immune cells.

Results: This analysis identified 26 shared DEGs in COVID-19 and KD. The results from functional annotation showed that the shared DEGs primarily functioned in immune responses, the formation of neutrophil extracellular traps, and NOD-like receptor signaling pathways. There were three key genes (PGLYRP1, DEFA4, RETN), and they had positive correlations with monocytes, M0 macrophages, and dendritic cells, which function as immune infiltrating cells in KD.

Conclusion: The potential immune-associated biomarkers (PGLYRP1, DEFA4, RETN) along with their shared pathways, hold promise for advancing investigations into the underlying pathogenesis of KD and COVID-19.

Keywords: Bioinformatics analysis; COVID-19; Immune cell infiltration; Kawasaki disease (KD); Weighted gene co-expression network analysis (WGCNA).

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Flow chart
Fig. 2
Fig. 2
WGCNA analysis of the GSE100154 dataset (KD vs. control). (a) Network topology for different values of the soft-threshold power, and selection of a β value of 18. (b) Gene clustering dendrogram, with different colors indicating distinct clusters. Thirteen modules had similar co-expression characteristics. (c) Heatmap for the correlation of different modules with disease phenotype. The pink and purple modules had positive correlations with KD, while the green-yellow and turquoise were negatively correlated
Fig. 3
Fig. 3
Identification of DEGs and in the GSE100154 dataset and the GSE22520 dataset. (a, b) Data normalization and volcano plot of DEGs in GSE100154(KD vs. control). (c, d) Data normalization and volcano plot of DEGs in GSE225220(COVID-19 vs. control). (adj. P < 0.05,|logFC|>1). (e) Shared genes in comparison of the WGCNA modules (n = 1047) and the DEGs in GSE100154 (n = 117). (f) Shared genes in comparison of the WGCNA modules and DEGs in GSE100154 (n = 56) and the DEGs in GSE225220 (n = 2711). logFC: log2(fold-change)
Fig. 4
Fig. 4
Functional enrichment analysis of COVID-19-related KD hub genes. (a) The top 5 GO terms related to three categories of hub genes (BP, biological process; CC, cellular component; MF, molecular function). (b) KEGG pathways related to the hub genes. Only highly enriched terms and pathways (P < 0.05) were retained
Fig. 5
Fig. 5
PPI network analysis of COVID-19-related KD hub genes. (a) PPI network of 26 hub genes. (b) Fifteen genes in the most representative module screened by the ‘MCODE’ plug-in. (c) Common hub genes identified from four algorithms of cytoHubba(MCC, EPC, Degree and MNC). (d) Network of the nine common hub genes. MCC: Maximal Clique Centrality, EPC: Edge Percolated Component, MNC: Maximum Neighborhood Component
Fig. 6
Fig. 6
Identification of the key COVID-19-related KD hub genes. (a, b) Screening of DEFA4, PGLYRP1, and RETN by LASSO regression analysis.The optimal lambda (lambda.1se) was 0.1563205 based on a 10-fold cross-validation. (c) ROC curves of DEFA4, PGLYRP1, RETN in the training dataset (GSE100154, KD vs. control). (d) ROC curves of DEFA4, PGLYRP1, and RETN in the validation dataset (GSE73461, KD vs. control). AUC: area under the curve
Fig. 7
Fig. 7
Validation of the expression of the key COVID-19-related KD hub genes. (a) Expression of DEFA4, PGLYRP1, and RETN in the GSE100154 dataset (KD vs. control). (b) Expression of DEFA4, PGLYRP1, and RETN in KD patients before and after IVIG treatment in the GSE18606 dataset. *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. logFC: log2(fold-change)
Fig. 8
Fig. 8
Relationship of immune cell infiltration and key COVID-19-related KD hub genes. (a) Proportions of 22 types of immune cells in each sample of the GSE100154 dataset (KD vs. control). (b) Proportions of 22 types of immune cells in each sample of the GSE225220 dataset (COVID-19 vs. control). (c) Infiltration status of immune cells in the two groups of the GSE100154 dataset. (d) Infiltration status of immune cells in the two groups of the GSE225220 dataset. (e) Correlation of DEFA4, PGLYRP1, and RETN with infiltration of different immune cells in the GSE100154 dataset. (f) Correlation of DEFA4, PGLYRP1, and RETN with infiltration of different immune cells in the GSE225220 dataset. *p < 0.05; **p < 0.01; ***p < 0.001

Similar articles

References

    1. Ozen S, et al. EULAR/PReS endorsed consensus criteria for the classification of childhood vasculitides. Ann Rheum Dis. 2006;65(7):936–41. 10.1136/ard.2005.046300. - PMC - PubMed
    1. Newburger JW, et al. Diagnosis, treatment, and long-term management of Kawasaki disease: a statement for health professionals from the committee on rheumatic fever, endocarditis and Kawasaki disease, Council on cardiovascular disease in the young. Am Heart Association Circulation. 2004;110(17):2747–71. 10.1161/01.cir.0000145143.19711.78. - PubMed
    1. McCrindle BW, et al. Diagnosis, treatment, and Long-Term management of Kawasaki disease: A scientific statement for health professionals from the American heart association. Circulation. 2017;135(17):e927–99. 10.1161/cir.0000000000000484. - PubMed
    1. Nakamura Y, et al. Epidemiologic features of Kawasaki disease in Japan: results of the 2009–2010 nationwide survey. J Epidemiol. 2012;22(3):216–21. 10.2188/jea.je20110126. - PMC - PubMed
    1. Harnden A, Tulloh R, Burgner D. Kawasaki disease. BMJ. 2014;349:g5336. 10.1136/bmj.g5336. - PubMed

LinkOut - more resources