Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 4;12(1):13409.
doi: 10.1038/s41598-022-17266-0.

Game-theoretic link relevance indexing on genome-wide expression dataset identifies putative salient genes with potential etiological and diapeutics role in colorectal cancer

Affiliations

Game-theoretic link relevance indexing on genome-wide expression dataset identifies putative salient genes with potential etiological and diapeutics role in colorectal cancer

Vishwa Jyoti Baruah et al. Sci Rep. .

Abstract

Diapeutics gene markers in colorectal cancer (CRC) can help manage mortality caused by the disease. We applied a game-theoretic link relevance Index (LRI) scoring on the high-throughput whole-genome transcriptome dataset to identify salient genes in CRC and obtained 126 salient genes with LRI score greater than zero. The biomarkers database lacks preliminary information on the salient genes as biomarkers for all the available cancer cell types. The salient genes revealed eleven, one and six overrepresentations for major Biological Processes, Molecular Function, and Cellular components. However, no enrichment with respect to chromosome location was found for the salient genes. Significantly high enrichments were observed for several KEGG, Reactome and PPI terms. The survival analysis of top protein-coding salient genes exhibited superior prognostic characteristics for CRC. MIR143HG, AMOTL1, ACTG2 and other salient genes lack sufficient information regarding their etiological role in CRC. Further investigation in LRI methodology and salient genes to augment the existing knowledge base may create new milestones in CRC diapeutics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Distribution of LRI values of the salient genes. The figure exhibits the distribution of 126 salient genes. Violin plot (A) with jittered points data points and mean value, Q–Q plot (B) and histogram combined density plot (C) to show distribution LRI values of these 126 salient genes. (D) Comparison of the 124 salient genes (out of 126 Gene IDs, two Gene IDs did not map to Entrez ID) with 171 (after removing duplicates from the total 180 genes) unique cancer biomarkers from CellMarker database exhibits overlap of only two genes and 122 genes exhibits uniqueness. (E) Comparison of the 124 salient genes with 24 DEGs of CRC exhibits overlap of only two genes.
Figure 2
Figure 2
Enrichment of the major Biological Processes (BP) associated with the 126 salient genes. (A) Represents the network of various sub-ontologies, and associated genes, (B) describes percentage terms per group for various BP that are significantly enriched in pie chart and (C) number of genes in each term with significance sign. Node size is inversely proportioned to the p-value, i.e., the lower the value, the bigger the node size and color represent a different group of terms. *Significant at p ≤ 0.05, and **Significant at p ≤ 0.001.
Figure 3
Figure 3
Enrichment of the major Molecular Function (MF) (A) and Cellular component (CC) associated with the 126 salient genes. (A) Describes percentage terms per group for various MF that are significantly enriched, and (B) shows various sub-ontologies of CC and associated genes. Node size is inversely proportional to the p-value, i.e., the smaller the value more considerable the node size and colour represents a different group of terms.
Figure 4
Figure 4
Enrichment of the major KEGG pathways associated with the 126 LRI genes. (A) Represents the network of various pathways and sub-pathways and associated genes where the size of the node is inversely proportional to the p-value, i.e., the lower the p-value, the bigger the node size and colour represents a different group of pathway terms. (B) Describes percentage terms per group for various parent pathways significantly enriched in pie chart, and (C) describes the number of genes in each pathway and sub-pathway term with significance sign. *Significant at p ≤ 0.05, and **Significant at p ≤ 0.001.
Figure 5
Figure 5
Enrichment of the major Reactome pathways and reactions. The figure represents the Enrichment of the Reactome pathways (A) and reactions (B) associated with the 126 salient genes (Labeled in red colour). Node size is inversely proportional to the p-value, i.e., the lower the p-value, the bigger the node size and colour represents a different group of terms.
Figure 6
Figure 6
Protein–Protein interaction (PPI) network among the 126 salient genes. At a confidence score of 0.700, the figure exhibits major interactions among the protein-coding genes as node and various interactions as edges. The colored nodes are query proteins with 3D structures (if any) inside the node and edge color represents evidence as an indicator of interactions among the proteins. The isolated nodes were removed from the network.
Figure 7
Figure 7
Diapeutics implication of top 10 protein-coding salient genes in CRC. Kaplan–Meier (KM) survival analysis of overall survival with respect to expression of top 10 protein-coding salient genes in CRC samples. In each plot, the abscissa represents ‘Time in Years’ and the ordinate represent ‘Survival Probability’. Log-rank p-value for KM plot represents a significant correlation between mRNA expression level and patient survival by exhibiting significant differences in survival between genes’ high and low expression. Protein-coding salient genes exhibited statistically significant (log-rank p-value ≤ 0.05) in the overall survival endpoint (refer to Supplementary File Figure S1–S10 for enlarged view).

Similar articles

Cited by

References

    1. Sung H, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021;71:209–249. - PubMed
    1. Guérin A, et al. Risk of developing colorectal cancer and benign colorectal neoplasm in patients with chronic constipation. Aliment. Pharmacol. Ther. 2014;40:83–92. doi: 10.1111/apt.12789. - DOI - PubMed
    1. Blasi VD, et al. Major hepatectomy for colorectal liver metastases in patients aged over 80: A propensity score matching analysis. Dig. Surg. 2018;35:333–341. doi: 10.1159/000486522. - DOI - PubMed
    1. Soliman AS, et al. Colorectal cancer in egyptian patients under 40 years of age. Int. J. Cancer. 1997;71:26–30. doi: 10.1002/(SICI)1097-0215(19970328)71:1<26::AID-IJC6>3.0.CO;2-5. - DOI - PubMed
    1. Redmond J, Vanderpool R, McClung R. Effectively communicating colorectal cancer screening information to primary care providers. Am. J. Health Educ. 2012;43:194–201. doi: 10.1080/19325037.2012.10599235. - DOI - PMC - PubMed

Publication types