LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier
- PMID: 37308797
- DOI: 10.1007/s12539-023-00573-z
LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier
Abstract
Numerous scientific evidences have revealed that long non-coding RNAs (lncRNAs) are involved in the progression of human complex diseases and biological life activities. Therefore, identifying novel and potential disease-related lncRNAs is helpful to diagnosis, prognosis and therapy of many human complex diseases. Since traditional laboratory experiments are cost and time-consuming, a great quantity of computer algorithms have been proposed for predicting the relationships between lncRNAs and diseases. However, there are still much room for the improvement. In this paper, we introduce an accurate framework named LDAEXC to infer LncRNA-Disease Associations with deep autoencoder and XGBoost Classifier. LDAEXC utilizes different similarity views of lncRNAs and human diseases to construct features for each data sources. Then, the reduced features are obtained by feeding the constructed feature vectors into a deep autoencoder, and at last an XGBoost classifier is leveraged to calculate the latent lncRNA-disease-associated scores using reduced features. The fivefold cross-validation experiments on four datasets showed that LDAEXC reached AUC scores of 0.9676 ± 0.0043, 0.9449 ± 0.022, 0.9375 ± 0.0331 and 0.9556 ± 0.0134, respectively, significantly higher than other advanced similar computer methods. Extensive experiment results and case studies of two complex diseases (colon and breast cancers) further indicated the practicability and excellent prediction performance of LDAEXC in inferring unknown lncRNA-disease associations. TLDAEXC utilizes disease semantic similarity, lncRNA expression similarity, and Gaussian interaction profile kernel similarity of lncRNAs and diseases for feature construction. The constructed features are fed to a deep autoencoder to extract reduced features, and an XGBoost classifier is used to predict the lncRNA-disease associations based on the reduced features. The fivefold and tenfold cross-validation experiments on a benchmark dataset showed that LDAEXC could achieve AUC scores of 0.9676 and 0.9682, respectively, significantly higher than other state-of-the-art similar methods.
Keywords: Deep autoencoder; LncRNA–disease associations prediction; XGBoost classifier.
© 2023. International Association of Scientists in the Interdisciplinary Areas.
Similar articles
-
LDNFSGB: prediction of long non-coding rna and disease association using network feature similarity and gradient boosting.BMC Bioinformatics. 2020 Sep 3;21(1):377. doi: 10.1186/s12859-020-03721-0. BMC Bioinformatics. 2020. PMID: 32883200 Free PMC article.
-
A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs.BMC Bioinformatics. 2020 Dec 2;21(1):555. doi: 10.1186/s12859-020-03906-7. BMC Bioinformatics. 2020. PMID: 33267800 Free PMC article.
-
LDAGM: prediction lncRNA-disease asociations by graph convolutional auto-encoder and multilayer perceptron based on multi-view heterogeneous networks.BMC Bioinformatics. 2024 Oct 15;25(1):332. doi: 10.1186/s12859-024-05950-z. BMC Bioinformatics. 2024. PMID: 39407120 Free PMC article.
-
RWSF-BLP: a novel lncRNA-disease association prediction model using random walk-based multi-similarity fusion and bidirectional label propagation.Mol Genet Genomics. 2021 May;296(3):473-483. doi: 10.1007/s00438-021-01764-3. Epub 2021 Feb 15. Mol Genet Genomics. 2021. PMID: 33590345 Review.
-
Computational models for lncRNA function prediction and functional similarity calculation.Brief Funct Genomics. 2019 Feb 14;18(1):58-82. doi: 10.1093/bfgp/ely031. Brief Funct Genomics. 2019. PMID: 30247501 Review.
Cited by
-
LDA-SCGB: inferring lncRNA-disease associations based on condensed gradient boosting.BMC Bioinformatics. 2025 Jul 22;26(1):190. doi: 10.1186/s12859-025-06169-2. BMC Bioinformatics. 2025. PMID: 40696287 Free PMC article.
-
MORE: a multi-omics data-driven hypergraph integration network for biomedical data classification and biomarker identification.Brief Bioinform. 2024 Nov 22;26(1):bbae658. doi: 10.1093/bib/bbae658. Brief Bioinform. 2024. PMID: 39692449 Free PMC article.
-
Applying negative sample denoising and multi-view feature for lncRNA-disease association prediction.Front Genet. 2024 Jan 9;14:1332273. doi: 10.3389/fgene.2023.1332273. eCollection 2023. Front Genet. 2024. PMID: 38264213 Free PMC article.
-
GEnDDn: An lncRNA-Disease Association Identification Framework Based on Dual-Net Neural Architecture and Deep Neural Network.Interdiscip Sci. 2024 Jun;16(2):418-438. doi: 10.1007/s12539-024-00619-w. Epub 2024 May 11. Interdiscip Sci. 2024. PMID: 38733474
-
HGCMLDA: predicting lncRNA-disease associations using hypergraph contrastive learning and multi-scale attentional feature fusion.Brief Bioinform. 2025 May 1;26(3):bbaf262. doi: 10.1093/bib/bbaf262. Brief Bioinform. 2025. PMID: 40495794 Free PMC article.
References
-
- Ponting CP, Oliver PL, Reik W (2009) Evolution and functions of long noncoding rnas. Cell 136(4):629–641. https://doi.org/10.1016/j.cell.2009.02.006 - DOI - PubMed
-
- Xiao B, Zhang X, Li Y, Tang Z, Yang S, Mu Y, Cui W, Ao H, Li K (2009) Identification, bioinformatic analysis and expression profiling of candidate mrna-like non-coding rnas in sus scrofa. J Genet Genom 36(12):695–702. https://doi.org/10.1016/S1673-8527(08)60162-9 - DOI
-
- Chen X, Sun YZ, Guan N, Qu J, Li JQ (2019) Computational models for lncrna function prediction and functional similarity calculation. Brief Funct Genom 18(1):58–82. https://doi.org/10.1093/bfgp/ely031 - DOI
-
- Lukiw WJ, Handley P, Wong L, Mclachlan DRC (1992) Bc200 rna in normal human neocortex, non-alzheimer dementia (nad), and senile dementia of the alzheimer type (ad). Neurochem Res 17(6):591–597. https://doi.org/10.1007/BF00968788 - DOI - PubMed
-
- Gupta RA, Shah N, Wang KC, Kim J, Horlings HM (2010) Long non-coding rna hotair reprograms chromatin state to promote cancer metastasis. Nature 464(7291):1071–1076. https://doi.org/10.1038/nature08975 - DOI - PubMed - PMC
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Medical