Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep;15(3):439-451.
doi: 10.1007/s12539-023-00573-z. Epub 2023 Jun 12.

LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier

Affiliations

LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier

Cuihong Lu et al. Interdiscip Sci. 2023 Sep.

Abstract

Numerous scientific evidences have revealed that long non-coding RNAs (lncRNAs) are involved in the progression of human complex diseases and biological life activities. Therefore, identifying novel and potential disease-related lncRNAs is helpful to diagnosis, prognosis and therapy of many human complex diseases. Since traditional laboratory experiments are cost and time-consuming, a great quantity of computer algorithms have been proposed for predicting the relationships between lncRNAs and diseases. However, there are still much room for the improvement. In this paper, we introduce an accurate framework named LDAEXC to infer LncRNA-Disease Associations with deep autoencoder and XGBoost Classifier. LDAEXC utilizes different similarity views of lncRNAs and human diseases to construct features for each data sources. Then, the reduced features are obtained by feeding the constructed feature vectors into a deep autoencoder, and at last an XGBoost classifier is leveraged to calculate the latent lncRNA-disease-associated scores using reduced features. The fivefold cross-validation experiments on four datasets showed that LDAEXC reached AUC scores of 0.9676 ± 0.0043, 0.9449 ± 0.022, 0.9375 ± 0.0331 and 0.9556 ± 0.0134, respectively, significantly higher than other advanced similar computer methods. Extensive experiment results and case studies of two complex diseases (colon and breast cancers) further indicated the practicability and excellent prediction performance of LDAEXC in inferring unknown lncRNA-disease associations. TLDAEXC utilizes disease semantic similarity, lncRNA expression similarity, and Gaussian interaction profile kernel similarity of lncRNAs and diseases for feature construction. The constructed features are fed to a deep autoencoder to extract reduced features, and an XGBoost classifier is used to predict the lncRNA-disease associations based on the reduced features. The fivefold and tenfold cross-validation experiments on a benchmark dataset showed that LDAEXC could achieve AUC scores of 0.9676 and 0.9682, respectively, significantly higher than other state-of-the-art similar methods.

Keywords: Deep autoencoder; LncRNA–disease associations prediction; XGBoost classifier.

PubMed Disclaimer

Similar articles

Cited by

References

    1. Ponting CP, Oliver PL, Reik W (2009) Evolution and functions of long noncoding rnas. Cell 136(4):629–641. https://doi.org/10.1016/j.cell.2009.02.006 - DOI - PubMed
    1. Xiao B, Zhang X, Li Y, Tang Z, Yang S, Mu Y, Cui W, Ao H, Li K (2009) Identification, bioinformatic analysis and expression profiling of candidate mrna-like non-coding rnas in sus scrofa. J Genet Genom 36(12):695–702. https://doi.org/10.1016/S1673-8527(08)60162-9 - DOI
    1. Chen X, Sun YZ, Guan N, Qu J, Li JQ (2019) Computational models for lncrna function prediction and functional similarity calculation. Brief Funct Genom 18(1):58–82. https://doi.org/10.1093/bfgp/ely031 - DOI
    1. Lukiw WJ, Handley P, Wong L, Mclachlan DRC (1992) Bc200 rna in normal human neocortex, non-alzheimer dementia (nad), and senile dementia of the alzheimer type (ad). Neurochem Res 17(6):591–597. https://doi.org/10.1007/BF00968788 - DOI - PubMed
    1. Gupta RA, Shah N, Wang KC, Kim J, Horlings HM (2010) Long non-coding rna hotair reprograms chromatin state to promote cancer metastasis. Nature 464(7291):1071–1076. https://doi.org/10.1038/nature08975 - DOI - PubMed - PMC

Substances

LinkOut - more resources