Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Dec 19;25(1):386.
doi: 10.1186/s12859-024-05999-w.

Prediction of miRNA-disease associations based on PCA and cascade forest

Affiliations

Prediction of miRNA-disease associations based on PCA and cascade forest

Chuanlei Zhang et al. BMC Bioinformatics. .

Abstract

Background: As a key non-coding RNA molecule, miRNA profoundly affects gene expression regulation and connects to the pathological processes of several kinds of human diseases. However, conventional experimental methods for validating miRNA-disease associations are laborious. Consequently, the development of efficient and reliable computational prediction models is crucial for the identification and validation of these associations.

Results: In this research, we developed the PCACFMDA method to predict the potential associations between miRNAs and diseases. To construct a multidimensional feature matrix, we consider the fusion similarities of miRNA and disease and miRNA-disease pairs. We then use principal component analysis(PCA) to reduce data complexity and extract low-dimensional features. Subsequently, a tuned cascade forest is used to mine the features and output prediction scores deeply. The results of the 5-fold cross-validation using the HMDD v2.0 database indicate that the PCACFMDA algorithm achieved an AUC of 98.56%. Additionally, we perform case studies on breast, esophageal and lung neoplasms. The findings revealed that the top 50 miRNAs most strongly linked to each disease have been validated.

Conclusions: Based on PCA and optimized cascade forests, we propose the PCACFMDA model for predicting undiscovered miRNA-disease associations. The experimental results demonstrate superior prediction performance and commendable stability. Consequently, the PCACFMDA is a potent instrument for in-depth exploration of miRNA-disease associations.

Keywords: Cascade forest; Ensemble learning; PCA; miRNA-disease Association.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interest.

Figures

Fig. 1
Fig. 1
The structure of PCACFMDA
Fig. 2
Fig. 2
Cumulative explained variance by principal components
Fig. 3
Fig. 3
Decision-making processes in forest
Fig. 4
Fig. 4
The ROC and PR curves of PCACFMDA in 5-fold cross-validation
Fig. 5
Fig. 5
The ROC and PR curves of PCACFMDA in 10-fold cross-validation
Fig. 6
Fig. 6
The ROC and PR curves of different classifiers in 5-fold cross-validation
Fig. 7
Fig. 7
The ROC and PR curves of different models in 5-fold cross-validation
Fig. 8
Fig. 8
The illustration of heatmap based on esophageal neoplasms-related miRNAs
Fig. 9
Fig. 9
Kaplan-Meier plots of hsa-miR-198, hsa-miR-30a, hsa-miR-31, and hsa-let-7b for survival of patients with breast cancer

Similar articles

References

    1. Ambros V. The functions of animal micrornas. Nature. 2004;431(7006):350–5. - PubMed
    1. Egloff S, O’Reilly D, Murphy S. Expression of human snRNA genes from beginning to end. Biochem Soc Trans. 2008;36(4):590–4. - PubMed
    1. Wang L, You Z-H, Huang Y-A, Huang D-S, Chan KC. An efficient approach based on multi-sources information to predict circRNA-disease associations using deep convolutional neural network. Bioinformatics. 2020;36(13):4038–46. - PubMed
    1. Wang L, You Z-H, Huang D-S, Li J-Q. MGRCDA: metagraph recommendation method for predicting circRNA-disease association. IEEE Trans Cybernet. 2021;53(1):67–75. - PubMed
    1. Huang Y-A, Chen X, You Z-H, Huang D-S, Chan KC. ILNCSIM: improved lncRNA functional similarity calculation model. Oncotarget. 2016;7(18):25902. - PMC - PubMed

LinkOut - more resources