Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 24;14(3):574.
doi: 10.3390/genes14030574.

An Entropy-Based Directed Random Walk for Cancer Classification Using Gene Expression Data Based on Bi-Random Walk on Two Separated Networks

Affiliations

An Entropy-Based Directed Random Walk for Cancer Classification Using Gene Expression Data Based on Bi-Random Walk on Two Separated Networks

Xin Hui Tay et al. Genes (Basel). .

Abstract

The integration of microarray technologies and machine learning methods has become popular in predicting the pathological condition of diseases and discovering risk genes. Traditional microarray analysis considers pathways as a simple gene set, treating all genes in the pathway identically while ignoring the pathway network's structure information. This study proposed an entropy-based directed random walk (e-DRW) method to infer pathway activities. Two enhancements from the conventional DRW were conducted, which are (1) to increase the coverage of human pathway information by constructing two inputting networks for pathway activity inference, and (2) to enhance the gene-weighting method in DRW by incorporating correlation coefficient values and t-test statistic scores. To test the objectives, gene expression datasets were used as input datasets while the pathway datasets were used as reference datasets to build two directed graphs. The within-dataset experiments indicated that e-DRW method demonstrated robust and superior performance in terms of classification accuracy and robustness of the predicted risk-active pathways compared to the other methods. In conclusion, the results revealed that e-DRW not only improved the prediction performance, but also effectively extracted topologically important pathways and genes that were specifically related to the corresponding cancer types.

Keywords: cancer classification; directed random walk; pathway-based analysis.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Workflow of e-DRW.
Figure 2
Figure 2
Comparison of classification performance.

References

    1. Xu P., Zhao G., Kou Z., Fang G., Liu W. Classification of cancers based on a comprehensive pathway activity inferred by genes and their interactions. IEEE Access. 2020;8:30515–30521. doi: 10.1109/ACCESS.2020.2973220. - DOI
    1. Liu W., Li C., Xu Y., Yang H., Yao Q., Han J., Shang D., Zhang C., Su F., Li X., et al. Topologically inferring risk-active pathways toward precise cancer classification by directed random walk. Bioinformatics. 2013;29:2169–2177. doi: 10.1093/bioinformatics/btt373. - DOI - PubMed
    1. Guo Z., Zhang T., Li X., Wang Q., Xu J., Yu H., Zhu J., Wang H., Wang C., Topol E.J., et al. Towards precise classification of cancers based on robust gene functional expression profiles. BMC Bioinform. 2005;6:58. doi: 10.1186/1471-2105-6-58. - DOI - PMC - PubMed
    1. Lee E., Chuang H.Y., Kim J.W., Ideker T., Lee D. Inferring pathway activity toward precise disease classification. PLoS Comput. Biol. 2008;4:e1000217. doi: 10.1371/journal.pcbi.1000217. - DOI - PMC - PubMed
    1. Su J., Yoon B.J., Dougherty E.R. Accurate and reliable cancer classification based on probabilistic inference of pathway activity. PLoS ONE. 2009;4:e8161. doi: 10.1371/journal.pone.0008161. - DOI - PMC - PubMed

Publication types