Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 9;23(1):119.
doi: 10.1186/s12915-025-02221-y.

iPiDA-LGE: a local and global graph ensemble learning framework for identifying piRNA-disease associations

Affiliations

iPiDA-LGE: a local and global graph ensemble learning framework for identifying piRNA-disease associations

Hang Wei et al. BMC Biol. .

Abstract

Background: Exploring piRNA-disease associations can help discover candidate diagnostic or prognostic biomarkers and therapeutic targets. Several computational methods have been presented for identifying associations between piRNAs and diseases. However, the existing methods encounter challenges such as over-smoothing in feature learning and overlooking specific local proximity relationships, resulting in limited representation of piRNA-disease pairs and insufficient detection of association patterns.

Results: In this study, we propose a novel computational method called iPiDA-LGE for piRNA-disease association identification. iPiDA-LGE comprises two graph convolutional neural network modules based on local and global piRNA-disease graphs, aimed at capturing specific and general features of piRNA-disease pairs. Additionally, it integrates their refined and macroscopic inferences to derive the final prediction result.

Conclusions: The experimental results show that iPiDA-LGE effectively leverages the advantages of both local and global graph learning, thereby achieving more discriminative pair representation and superior predictive performance.

Keywords: Graph ensemble learning; Local context graph; piRNA-disease association identification.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Parameter analysis of local-level graph learning in iPiDA-LGE. a, b, c, and d illustrate the AUC and AUPR obtained by local-level graph learning module with different neighbor orders, epochs, learning rates, and GCN layers on Svalidation, respectively
Fig. 2
Fig. 2
Comparison between local/global graph learning and their combination. a shows the AUC and AUPR achieved by iPiDA-LGE with different fusion coefficients on Svalidation. b and c show the ROC and PR curves obtained by different predictors on Svalidation, respectively. df show the binary distribution of true labels and association scores predicted by iPiDA-L, iPiDA-G, and iPiDA-LGE on Svalidation
Fig. 3
Fig. 3
Analysis of features extracted by local and global graph learning. a shows the t-SNE visualization of features extracted by iPiDA-A, iPiDA-G, and iPiDA-L, respectively. b and c show local context graphs for two example positive and two example negative piRNA-disease pairs in Sindependent, respectively. d shows the heatmap of features extracted by iPiDA-G and iPiDA-L for four example piRNA-disease pairs
Fig. 4
Fig. 4
Comparison of different methods across four comprehensive metrics on 100 partitioned independent test sets. Wilcoxon rank-sum test is used to calculate the statistical difference between two groups of results. Comparisons with p values < 0.05 are marked with *, p values < 0.01 with **, p values < 0.001 with ***, and “ns” indicates no significant difference
Fig. 5
Fig. 5
The framework of iPiDA-LGE. There exist three primary processes: (i) Graph construction. The global graph is constructed and supplemented with piRNA sequences, disease ontology knowledge, and validated piRNA-disease relationships. In addition, the local context graph for each target pair is extracted from original bipartite graph. (ii) Local/global graph representation. The representations of piRNA and disease nodes are captured by global-level GCN, while the pair representations are obtained by local-level GCN. (iii) Association prediction. The dense layer and multi-layer perceptron are applied to reduce feature dimensionality and calculate association scores. Finally, the global-level and local-level association scores are integrated with different weight coefficients to predict the relationships between piRNAs and diseases
None
Algorithm 1. Local context graph extraction

Similar articles

References

    1. Seto AG, Kingston RE, Lau NC. The coming of age for Piwi proteins. Mol Cell. 2007;26(5):603–9. - PubMed
    1. Wang X, Ramat A, Simonelig M, Liu MF. Emerging roles and functional mechanisms of PIWI-interacting RNAs. Nat Rev Mol Cell Biol. 2023;24(2):123–41. - PubMed
    1. Guo X, Huang Z, Ju F, Zhao C, Yu L. Highly accurate estimation of cell type abundance in bulk tissues based on single-cell reference and domain adaptive matching. Adv Sci. 2024;11(7):2306329. - PMC - PubMed
    1. Zuo Y, Zou Q, Lin J, Jiang M, Liu X. 2lpiRNApred: a two-layered integrated algorithm for identifying piRNAs and their functions based on LFE-GM feature selection. RNA Biol. 2020;17(6):892–902. - PMC - PubMed
    1. Li YH, Xu JY, Tao L, Li XF, Li S, Zeng X, Chen SY, Zhang P, Qin C, Zhang C. SVM-Prot 2016: a web-server for machine learning prediction of protein functional families from sequence irrespective of similarity. PLoS ONE. 2016;11(8): e0155290. - PMC - PubMed

Substances

LinkOut - more resources