Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Dec 19:13:1010345.
doi: 10.3389/fimmu.2022.1010345. eCollection 2022.

Construction of prediction model of inflammation related genes in idiopathic pulmonary fibrosis and its correlation with immune microenvironment

Affiliations

Construction of prediction model of inflammation related genes in idiopathic pulmonary fibrosis and its correlation with immune microenvironment

Ying-Qiu Yin et al. Front Immunol. .

Abstract

Background: The role of inflammation in the formation of idiopathic pulmonary fibrosis (IPF) has gained a lot of attention recently. However, the involvement of genes related to inflammation and immune exchange environment status in the prognosis of IPF remains to be further clarified. The objective of this research is to establish a new model for the prediction of the overall survival (OS) rate of inflammation-related IPF.

Methods: Gene Expression Omnibus (GEO) was employed to obtain the three expression microarrays of IPF, including two from alveolar lavage fluid cells and one from peripheral blood mononuclear cells. To construct the risk assessment model of inflammation-linked genes, least absolute shrinkage and selection operator (lasso), univariate cox and multivariate stepwise regression, and random forest method were used. The proportion of immune cell infiltration was evaluated by single sample Gene Set Enrichment Analysis (ssGSEA) algorithm.

Results: The value of genes linked with inflammation in the prognosis of IPF was analyzed, and a four-genes risk model was constructed, including tpbg, Myc, ffar2, and CCL2. It was highlighted by Kaplan Meier (K-M) survival analysis that patients with high-risk scores had worse overall survival time in all training and validation sets, and univariate and multivariate analysis highlighted that it has the potential to act as an independent risk indicator for poor prognosis. ROC analysis showed that the prediction efficiency of 1-, 3-, and 5-year OS time in the training set reached 0.784, 0.835, and 0.921, respectively. Immune infiltration analysis showed that Myeloid-Derived Suppressor Cells (MDSC), macrophages, regulatory T cells, cd4+ t cells, neutrophils, and dendritic cells were more infiltrated in the high-risk group than in the low-risk group.

Conclusion: Inflammation-related genes can be well used to evaluate the IPF prognosis and impart a new idea for the treatment and follow-up management of IPF patients.

Keywords: idiopathic pulmonary fibrosis; immune microenvironment; inflammation; prognosis; ssGSEA.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Expression and enrichment analysis of inflammation-linked genes in IPF. (A) Heat map of the expression of genes linked with inflammation in the discovery set. (B) GO enrichment analysis of DEGs. (C) KEGG analysis of DEGs.
Figure 2
Figure 2
Identifying prognostic biomarkers. (A) Univariate Cox regression identifying genes associated with IPF prognosis. (B) Lasso regression further identifying prognosis-related genes. (C) Random Forest identifying the top 10 genes that are relatively important in prognosis. (D) Wayne diagram identifying the common genes selected by the two methods. (E) Multivariate stepwise Cox regression finally determined the prognosis-related genes.
Figure 3
Figure 3
Prognostic value of risk model in the training cohort. (A) K-M method was used to draw the survival curve according to risk score, and for comparison, a log-rank test was employed. (B) The distribution of risk score and survival status between high and low-risk groups. (C) To evaluate the differentiation between groups with high and low risk, PCA was utilized. (D) Through ROC analysis, the predictive effect of the risk model in the training queue was evaluated. (E) Univariate and multivariate Cox analysis. (F) C-index analysis was used to evaluate the prediction ability of the model.
Figure 4
Figure 4
The effectiveness of the risk model was verified in the validation set. (A) K-M method was used to draw the survival curve according to risk score, and for the comparison, the log-rank test was employed. (B) The distribution of risk score and survival status between high and low-risk groups. (C) To evaluate the differentiation between groups with high and low risk, PCA was utilized. (D) Through ROC analysis, the predictive role of the risk model in the validation queue was evaluated. (E) Univariate and multivariate Cox analysis. (F) C-index analysis was used to evaluate the prediction ability of the model.
Figure 5
Figure 5
The effectiveness of the risk model was verified in the validation set. (A) K-M method was used to draw the survival curve based on risk score, and for comparison, a log-rank test was employed. (B) The distribution of RS and survival status between high and low-risk groups. (C) To evaluate the differentiation between groups with high and low-risk, PCA was utilized. (D) Through ROC analysis, the predictive role of the risk model in the validation cohort was evaluated. (E) Univariate and multivariate Cox analysis. (F) for the evaluation of the prediction ability of the model, C-index analysis was employed.
Figure 6
Figure 6
Survival analysis of risk model in different subgroups. Survival analysis of risk models in the subgroup of age ≤ 65 (A) and greater than 65 (B). Survival analysis of risk models in female (C) and male (D).
Figure 7
Figure 7
Immune infiltration analysis. (A) In the discovery set, the immune cell infiltration between groups with high and low risk was evaluated based on the ssGSEA algorithm. (B) In the validation set, the immune cell infiltration between high-risk and low-risk groups was analyzed based on the ssGSEA algorithm ns, non significance; *p<0.05; **p<0.01; ***p<0.001.
Figure 8
Figure 8
Construction and evaluation of nomogram. (A) Nomogram was constructed based on the expression values of four genes to predict the OS rate at 1, 3, and 5 years. (B) The calibration curve was used to assess the nomogram.

References

    1. Martinez FJ, Collard HR, Pardo A, Raghu G, Richeldi L, Selman M, et al. . Idiopathic pulmonary fibrosis. Nat Rev Dis Primers (2017) 3:17074. doi: 10.1038/nrdp.2017.74 - DOI - PubMed
    1. Richeldi L, Collard HR, Jones MG. Idiopathic pulmonary fibrosis. Lancet (2017) 389(10082):1941–52. doi: 10.1016/S0140-6736(17)30866-8 - DOI - PubMed
    1. Olson AL, Gifford AH, Inase N, Fernandez Perez ER, Suda T. The epidemiology of idiopathic pulmonary fibrosis and interstitial lung diseases at risk of a progressive-fibrosing phenotype. Eur Respir Rev (2018) 27(150):180077. doi: 10.1183/16000617.0077-2018 - DOI - PMC - PubMed
    1. Molyneaux PL, Maher TM. The role of infection in the pathogenesis of idiopathic pulmonary fibrosis. Eur Respir Rev (2013) 22(129):376–81. doi: 10.1183/09059180.00000713 - DOI - PMC - PubMed
    1. Phan THG, Paliogiannis P, Nasrallah GK, Giordo R, Eid AH, Fois AG, et al. . Emerging cellular and molecular determinants of idiopathic pulmonary fibrosis. Cell Mol Life Sci (2021) 78(5):2031–57. doi: 10.1007/s00018-020-03693-7 - DOI - PMC - PubMed