Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Oct 7:20:5535-5546.
doi: 10.1016/j.csbj.2022.10.004. eCollection 2022.

Recognition of driver genes with potential prognostic implications in lung adenocarcinoma based on H3K79me2

Affiliations

Recognition of driver genes with potential prognostic implications in lung adenocarcinoma based on H3K79me2

Lu-Qiang Zhang et al. Comput Struct Biotechnol J. .

Abstract

Lung adenocarcinoma is a malignancy with a low overall survival and a poor prognosis. Studies have shown that lung adenocarcinoma progression relates to locus-specific/global changes in histone modifications. To explore the relationship between histone modification and gene expression changes, we focused on 11 histone modifications and quantitatively analyzed their influences on gene expression. We found that, among the studied histone modifications, H3K79me2 displayed the greatest impact on gene expression regulation. Based on the Shannon entropy, 867 genes with differential H3K79me2 levels during tumorigenesis were identified. Enrichment analyses showed that these genes were involved in 16 common cancer pathways and 11 tumors and were target-regulated by trans-regulatory elements, such as Tp53 and WT1. Then, an open-source computational framework was presented (https://github.com/zlq-imu/Identification-of-potential-LUND-driver-genes). Twelve potential driver genes were extracted from the genes with differential H3K79me2 levels during tumorigenesis. The expression levels of these potential driver genes were significantly increased/decreased in tumor cells, as assayed by RT-qPCR. A risk score model comprising these driver genes was further constructed, and this model was strongly negatively associated with the overall survival of patients in different datasets. The proportional hazards assumption and outlier test indicated that this model could robustly distinguish patients with different survival rates. Immune analyses and responses to immunotherapeutic and chemotherapeutic agents showed that patients in the high and low-risk groups may have distinct tendencies for clinical selection. Finally, the regions with clear H3K79me2 signal changes on these driver genes were accurately identified. Our research may offer potential molecular biomarkers for lung adenocarcinoma treatment.

Keywords: Driver genes; Gene expression; H3K79me2; Lung adenocarcinoma.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

None
Graphical abstract
Fig. 1
Fig. 1
Correlation analysis of HM signals and gene expression. (A, C) Up-DEGs, (B, D) Down-DEGs. (A, B) HM signal changes (purple bars) and significant differences (pink bars) during LUAD tumorigenesis. The red lines in the 1st and 2nd circles correspond to the change ratio of HM signal = 1 and –log10(P value) = 2. (C, D) Spearman correlation coefficients between gene expression changes and HM signal changes in each of the 100 bins. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2
Fig. 2
H3K79me2 signals are strongly correlated with gene expression levels in LUAD tested via random forest algorithm. (A) ROC curves for the top five HMs. (B) The prediction results tested by the RF, GLM and SVM algorithms and the top five vital bins for each HM obtained by the RF algorithm. (C) AUC distributions for HM combinations with the same number of HMs. (D) and (E) HM occurrence times in the studied 5-HM-based models and 6-HM-based models, respectively. (F) AUC distributions for 11 HMs in the same bin across the 100 bins around the TSS. (G) The ranks of 11 HMs in the same bin. Top-ranked HMs indicate higher ‘IncMSE’ values and greater effects on gene expression.
Fig. 3
Fig. 3
Enrichment analyses of the genes with differential H3K79me2 levels during oncogenesis. (A) The top 20 significant pathways in which the genes with differential H3K79me2 levels participated. (B) The interconnections of the 20 pathways described in A. (C) The top 20 diseases related to the genes with differential H3K79me2 levels and the corresponding number of genes in each disease. (D) The top 20 trans-regulatory elements and the number of genes regulated by these trans-regulatory elements.
Fig. 4
Fig. 4
Construction, evaluation and validation of the risk score model. (A) RT–qPCR analyses for the 12 PLDGs. The relative expression levels were calculated via2-ΔΔCt. (B and D) Kaplan–Meier survival curves for all-, high- and low-risk groups in the TCGA and GEO cohorts. (C and E) ROC curves for the risk score model in the TCGA and GEO databases. The distributions of the risk scores in LUAD patients with different (F) T-categories, (G) N-categories, (H) M-categories and (I) tumor stages. (J) Univariate and (K) multivariate Cox analyses for the risk score model and clinical characteristics.
Fig. 5
Fig. 5
The risk score model predicts the sensitivities of drug therapies. (A) Landscape of immune cell infiltrations in the high- and low-risk groups. Red and blue represent cells with higher and lower infiltration levels, respectively. *P < 0.05; **P < 0.01; ***P < 0.001; ns not significant. (B) Association between risk scores and immune cell infiltration (all P < 0.001). Immunophenoscores comparison between the high- and low-risk groups for LUAD patients treated with (C) CTLA4_neg_PD1_neg, (D) CTLA4_neg_PD‐1_pos, (E) CTLA4_ pos_PD‐1_neg, and (F) CTLA4_ pos_PD‐1_pos. PD‐1_pos or CTLA4_pos indicates anti‐PD‐1 or anti‐CTLA4 therapy, respectively. (G) The ratios of normalized IC50 values of the 112 drugs between the high- and low-risk groups. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 6
Fig. 6
The distributions of H3K79me2 signals on the 12 potential LUAD driver genes. The green and yellow histograms represent the average distributions of H3K79me2 in LUAD tumor cells and normal cells, respectively. The grey lines show the ratios of H3K79me2 signal changes. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Similar articles

Cited by

References

    1. Qiu M.T., Xia W.J., Chen R., Wang S.W., Xu Y.T., et al. The circular RNA circPRKCI promotes tumor growth in lung adenocarcinoma. Cancer Res. 2018;78(1):2839–2851. - PubMed
    1. Chen W.Q., Zheng R.S., Baade P.D., Zhang S.W., Zeng H.M., et al. Cancer statistics in China, 2015. Ca-Cancer J Clin. 2016;66:115–132. - PubMed
    1. Wu J., Li L., Zhang H.B., Zhao Y.Q., Zhang H.H., et al. A risk model developed based on tumor microenvironment predicts overall survival and associates with tumor immunity of patients with lung adenocarcinoma. Oncogene. 2021;40:4413–4424. - PubMed
    1. Wang Y.Y., Zheng D.F., Chen T.X., Zhang J., Yao F., et al. Survival prediction and adjuvant chemotherapy based on tumor marker for stage IB lung adenocarcinoma. Ann Thorac Surg. 2020;109:927–937. - PubMed
    1. Papikian A., Liu W., Gallego-Bartolome J., Jacobsen S.E. Site-specific manipulation of Arabidopsis loci using CRISPR-Cas9 SunTag systems. Nat Commun. 2019;10:729. - PMC - PubMed