Prediction of miRNA-disease associations based on PCA and cascade forest
- PMID: 39701957
- PMCID: PMC11660965
- DOI: 10.1186/s12859-024-05999-w
Prediction of miRNA-disease associations based on PCA and cascade forest
Abstract
Background: As a key non-coding RNA molecule, miRNA profoundly affects gene expression regulation and connects to the pathological processes of several kinds of human diseases. However, conventional experimental methods for validating miRNA-disease associations are laborious. Consequently, the development of efficient and reliable computational prediction models is crucial for the identification and validation of these associations.
Results: In this research, we developed the PCACFMDA method to predict the potential associations between miRNAs and diseases. To construct a multidimensional feature matrix, we consider the fusion similarities of miRNA and disease and miRNA-disease pairs. We then use principal component analysis(PCA) to reduce data complexity and extract low-dimensional features. Subsequently, a tuned cascade forest is used to mine the features and output prediction scores deeply. The results of the 5-fold cross-validation using the HMDD v2.0 database indicate that the PCACFMDA algorithm achieved an AUC of 98.56%. Additionally, we perform case studies on breast, esophageal and lung neoplasms. The findings revealed that the top 50 miRNAs most strongly linked to each disease have been validated.
Conclusions: Based on PCA and optimized cascade forests, we propose the PCACFMDA model for predicting undiscovered miRNA-disease associations. The experimental results demonstrate superior prediction performance and commendable stability. Consequently, the PCACFMDA is a potent instrument for in-depth exploration of miRNA-disease associations.
Keywords: Cascade forest; Ensemble learning; PCA; miRNA-disease Association.
© 2024. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interest.
Figures









Similar articles
-
MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for miRNA-disease association prediction.PLoS Comput Biol. 2018 Aug 24;14(8):e1006418. doi: 10.1371/journal.pcbi.1006418. eCollection 2018 Aug. PLoS Comput Biol. 2018. PMID: 30142158 Free PMC article.
-
An improved random forest-based computational model for predicting novel miRNA-disease associations.BMC Bioinformatics. 2019 Dec 3;20(1):624. doi: 10.1186/s12859-019-3290-7. BMC Bioinformatics. 2019. PMID: 31795954 Free PMC article.
-
NEMPD: a network embedding-based method for predicting miRNA-disease associations by preserving behavior and attribute information.BMC Bioinformatics. 2020 Sep 10;21(1):401. doi: 10.1186/s12859-020-03716-x. BMC Bioinformatics. 2020. PMID: 32912137 Free PMC article.
-
EGBMMDA: Extreme Gradient Boosting Machine for MiRNA-Disease Association prediction.Cell Death Dis. 2018 Jan 5;9(1):3. doi: 10.1038/s41419-017-0003-x. Cell Death Dis. 2018. PMID: 29305594 Free PMC article.
-
MicroRNAs and complex diseases: from experimental results to computational models.Brief Bioinform. 2019 Mar 22;20(2):515-539. doi: 10.1093/bib/bbx130. Brief Bioinform. 2019. PMID: 29045685 Review.
References
-
- Ambros V. The functions of animal micrornas. Nature. 2004;431(7006):350–5. - PubMed
-
- Egloff S, O’Reilly D, Murphy S. Expression of human snRNA genes from beginning to end. Biochem Soc Trans. 2008;36(4):590–4. - PubMed
-
- Wang L, You Z-H, Huang Y-A, Huang D-S, Chan KC. An efficient approach based on multi-sources information to predict circRNA-disease associations using deep convolutional neural network. Bioinformatics. 2020;36(13):4038–46. - PubMed
-
- Wang L, You Z-H, Huang D-S, Li J-Q. MGRCDA: metagraph recommendation method for predicting circRNA-disease association. IEEE Trans Cybernet. 2021;53(1):67–75. - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials