Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 1:14:1206154.
doi: 10.3389/fendo.2023.1206154. eCollection 2023.

Identification of endoplasmic reticulum stress-related biomarkers of diabetes nephropathy based on bioinformatics and machine learning

Affiliations

Identification of endoplasmic reticulum stress-related biomarkers of diabetes nephropathy based on bioinformatics and machine learning

Jiaming Su et al. Front Endocrinol (Lausanne). .

Abstract

Backgrounds: Diabetes nephropathy (DN) is a growing public health concern worldwide. Renal dysfunction impairment in DN is intimately linked to ER stress and its related signaling pathways. Nonetheless, the underlying mechanism and biomarkers for this function of ER stress in the DN remain unknown.

Methods: Microarray datasets were retrieved from the Gene Expression Omnibus (GEO) database, and ER stress-related genes (ERSRGs) were downloaded from the MSigDB and GeneCards database. We identified hub ERSRGs for DN progression by intersecting ERSRGs with differentially expressed genes and significant genes in WGCNA, followed by a functional analysis. After analyzing hub ERSRGs with three machine learning techniques and taking the intersection, we did external validation as well as developed a DN diagnostic model based on the characteristic genes. Immune infiltration was performed using CIBERSORT. Moreover, patients with DN were then categorized using a consensus clustering approach. Eventually, the candidate ERSRGs-specific small-molecule compounds were defined by CMap.

Results: Several biological pathways driving pathological injury of DN and disordered levels of immune infiltration were revealed in the DN microarray datasets and strongly related to deregulated ERSRGs by bioinformatics multi-chip integration. Moreover, CDKN1B, EGR1, FKBP5, GDF15, and MARCKS were identified as ER stress signature genes associated with DN by machine learning algorithms, demonstrating their potential as DN biomarkers.

Conclusions: Our research sheds fresh light on the function of ER stress in DN pathophysiology and the development of early diagnostic and ER stress-related treatment targets in patients with DN.

Keywords: WGCNA (weighted gene co-expression network analysis); diabetic nephropathy; endoplasmic reticulum stress; immune cell infiltration; machine learning; molecular subtypes.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Identification of DEGs in glomeruli of DN patients. (A) Gene expression profiles without the removal of the batch effect. (B) Gene expression profiles with removal of batch effect. (C) Active GO functional enrichment of gene expression matrix in DN. (D) Active KEGG pathway of gene expression matrix in DN.
Figure 2
Figure 2
Construction of WGCNA co-expression network. (A) Sample clustering dendrogram with tree leaves corresponding to individual samples. (B) The screen of the best soft thresholds. Six was considered the best soft threshold. (C) The merging of similar modules. (D) Correlations between different modules and clinical traits. Red represents a positive correlation, and blue represents a negative correlation. (E) Clustering dendrogram of module feature genes. (F) Collinear heat map of module feature genes. Red color indicates a high correlation, blue color indicates opposite results. (G) The significance of genes related to DN in the brown, black, green and salmon module (a dot represents the genes in the module).
Figure 3
Figure 3
Identification of hub ERSRGs in the modules. (A) Venn diagram. (B) Heat map of hub ERSRGs. (C) GO enrichment analysis of hub ERSRGs. (D) KEGG enrichment analysis of hub ERSRGs.
Figure 4
Figure 4
The selection of characteristic genes of DN via machine learning algorithm. (A) Twenty-one characteristic genes of LASSO. (B) Nine characteristic genes of SVM-RFE. (C) Top twenty characteristic genes of RF. (D) Visualization of intersecting genes. (E) GBDT to verified the selected intersecting genes reliability.
Figure 5
Figure 5
Validation of characteristic genes in the gene chip datasets. (A) Correlation analysis of characteristic genes. (B) Representative violin plots present the expression of characteristic genes in the multi-chip dataset. (C) Representative violin plots present the expression of characteristic genes in tubulointerstitium dataset (GSE104954). (D) Representative violin plots present the expression of characteristic genes in kidney biopsy dataset (GSE142025). (E) Prediction model of nomogram. (F) The ROC curves for evaluating the diagnostic performance. *p-value < 0.05; **p-value < 0.01; ***p-value < 0.001.
Figure 6
Figure 6
Immune cell infiltration analysis. (A) Distribution of 22 kinds of immune cells in tissues of DN and control groups. (B) Correlation diagram between immune cells. (C) Expression of immune cells in DN and control groups. (D) Immune cells correlation with the expression of characteristic genes.
Figure 7
Figure 7
Identification of ER stress-associated molecular patterns in DN. (A) Consensus clustering matrix when k = 2. (B) Representative CDF curves when k = 2 to 6. (C) Relative alterations in CDF delta area curves. (D) Consensus score in each subtype when k = 2 to 6. (E) PCA analysis demonstrates that the DN patients are classified into two distinct subtypes.
Figure 8
Figure 8
The different immune characteristics and molecular mechanisms between two subtypes. (A) Vio plots showing the mRNA expression of characteristic genes in two ER stress subtypes. *p-value < 0.05; **p-value < 0.01; ***p-value < 0.001. (B) Vio plots demonstrating the infiltration levels of immune cell components in two ER stress subtypes. (C) Differences in enriched biological functions between ER stress subtypes ranked by t value of GSVA score. (D) Differences in the enriched hallmark pathways between ER stress subtypes ranked by t value of GSVA score.

Similar articles

Cited by

References

    1. Oshima M, Shimizu M, Yamanouchi M, Toyama T, Hara A, Furuichi K, et al. . Trajectories of kidney function in diabetes: a clinicopathological update. Nat Rev Nephrol (2021) 17:740–50. doi: 10.1038/s41581-021-00462-y - DOI - PubMed
    1. Sun H, Saeedi P, Karuranga S, Pinkepank M, Ogurtsova K, Duncan BB, et al. . IDF diabetes atlas: global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract (2022) 183:109119. doi: 10.1016/j.diabres.2021.109119 - DOI - PMC - PubMed
    1. GBD Chronic Kidney Disease Collaboration . Global, regional, and national burden of chronic kidney disease, 1990-2017: a systematic analysis for the global burden of disease study 2017. Lancet (2020) 395:709–33. doi: 10.1016/S0140-6736(20)30045-3 - DOI - PMC - PubMed
    1. Keri KC, Samji NS, Blumenthal S. Diabetic nephropathy: newer therapeutic perspectives. J Community Hosp Intern Med Perspect (2018) 8:200–07. doi: 10.1080/20009666.2018.1500423 - DOI - PMC - PubMed
    1. Dewanjee S, Bhattacharjee N. MicroRNA: a new generation therapeutic target in diabetic nephropathy. Biochem Pharmacol (2018) 155:32–47. doi: 10.1016/j.bcp.2018.06.017 - DOI - PubMed

Publication types