Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 6;15(2):234.
doi: 10.3390/biom15020234.

Prediction of circRNA-Disease Associations via Graph Isomorphism Transformer and Dual-Stream Neural Predictor

Affiliations

Prediction of circRNA-Disease Associations via Graph Isomorphism Transformer and Dual-Stream Neural Predictor

Hongchan Li et al. Biomolecules. .

Abstract

Circular RNAs (circRNAs) have attracted increasing attention for their roles in human diseases, making the prediction of circRNA-disease associations (CDAs) a critical research area for advancing disease diagnosis and treatment. However, traditional experimental methods for exploring CDAs are time-consuming and resource-intensive, while existing computational models often struggle with the sparsity of CDA data and fail to uncover potential associations effectively. To address these challenges, we propose a novel CDA prediction method named the Graph Isomorphism Transformer with Dual-Stream Neural Predictor (GIT-DSP), which leverages knowledge graph technology to address data sparsity and predict CDAs more effectively. Specifically, the model incorporates multiple associations between circRNAs, diseases, and other non-coding RNAs (e.g., lncRNAs, and miRNAs) to construct a multi-source heterogeneous knowledge graph, thereby expanding the scope of CDA exploration. Subsequently, a Graph Isomorphism Transformer model is proposed to fully exploit both local and global association information within the knowledge graph, enabling deeper insights into potential CDAs. Furthermore, a Dual-Stream Neural Predictor is introduced to accurately predict complex circRNA-disease associations in the knowledge graph by integrating dual-stream predictive features. Experimental results demonstrate that GIT-DSP outperforms existing state-of-the-art models, offering valuable insights for precision medicine and disease-related research.

Keywords: circRNA–disease associations; graph isomorphism network; knowledge representation learning; transformer.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Flowchart of GIT-DSP. (A) Construct the CDA knowledge graph with multi-source heterogeneous datasets. (B) Construct a fused similarity network based on the association matrix. (C) Generate high-quality embeddings through attention-based information propagation with the Graph Isomorphism Transformer. (D) Predict CDAs.
Figure 2
Figure 2
Visualization of datasets. (AC) represent the association distributions of Dataset 1, Dataset 2, and Dataset 3, respectively. This contains five associations (lnc-dis (lncRNA–disease), lnc-mi (lncRNA–miRNA), mi-dis (miRNA–disease), circ-mi (circRNA–miRNA), and circ-dis (circRNA–disease)).
Figure 3
Figure 3
Association between CircRNA and diseases. (A) The horizontal axis indicates the circRNA count, while the vertical axis shows the disease count per degree. (B) Sparsity of some circRNA–disease associations.
Figure 4
Figure 4
Overview of the knowledge graph. It encompasses four types of entities: disease, circRNA, lncRNA, and miRNA, as well as five different types of relationships: miRNA–circRNA, miRNA–lncRNA, disease–miRNA, disease–lncRNA, and circRNA–disease.
Figure 5
Figure 5
Ranking of the expressive power of Sum, Mean, and Max on multisets. The input part represents the aggregated neighborhood network; Sum learns the entire multiset, Mean learns the overall distribution, and Max ignores redundant information.
Figure 6
Figure 6
The principle of graph isomorphism network. (A) Weighted self-loops are added to the nodes to ensure they retain their unique feature information during aggregation. (B) Exchange the neighboring nodes with weighted self-loops. (C) The corresponding parts from (B) are aggregated to update each node’s representation.
Figure 7
Figure 7
Performance comparison of AUC (A) and AUPR (B) on Dataset 1. The compared models include information-propagation-based models (KATZHCDA, RWR, and CD-LNLP), traditional machine learning models (RWR-KNN, ICIRCDA, and RNMFLP), and deep learning models (DMFCDA, GMNN2CD, and KGETCDA).
Figure 8
Figure 8
Performance comparison of AUC (A) and AUPR (B) on Dataset 2. The compared models include information-propagation-based models (KATZHCDA, RWR, and CD-LNLP), traditional machine learning models (RWR-KNN, ICIRCDA, and RNMFLP), and deep learning models (DMFCDA, GMNN2CD, and KGETCDA).
Figure 9
Figure 9
Performance comparison of AUC (A) and AUPR (B) on Dataset 3. The compared models include information-propagation-based models (KATZHCDA, RWR, and CD-LNLP), traditional machine learning models (RWR-KNN, ICIRCDA, and RNMFLP), and deep learning models (DMFCDA, GMNN2CD, and KGETCDA).
Figure 10
Figure 10
Comparison of the average number of accurately identified associations for Dataset 1 (A), Dataset 2 (B), and Dataset 3 (C). The compared models include information-propagation-based models (KATZHCDA, RWR, and CD-LNLP), traditional machine learning models (RWR-KNN, ICIRCDA, and RNMFLP), and deep learning models (DMFCDA, GMNN2CD, and KGETCDA).
Figure 11
Figure 11
Violin plots comparing the performance of different aggregation and prediction modules. (A) AUC values for Dataset 1; (B) AUC values for Dataset 2; (C) AUC values for Dataset 3. It covers GIN+MLP, GIN+SMLP, and GIN+DSP(Ours) based on GIN, GCN+DSP based on GCN, BI-Interaction+DSP based on Bi-Interaction, and GraphSage+DSP based on GraphSage.
Figure 12
Figure 12
Comparison of methods for neighborhood feature calculation. (A,B) represent the AUC and AUPR values on the three datasets, respectively.
Figure 13
Figure 13
Performance comparison of different attention heads and layers. (A) AUC heat map for Dataset 1; (B) AUC heat map for Dataset 2.

References

    1. Gu A., Jaijyan D.K., Yang S.A.-O., Zeng M., Pei S., Zhu H.A.-O. Functions of Circular RNA in Human Diseases and Illnesses. Non-Coding RNA. 2023;9:38. doi: 10.3390/ncrna9040038. - DOI - PMC - PubMed
    1. Xie G., Lei B., Yin Z., Xu F., Liu X. CircMTA2 Drives Gastric Cancer Progression through Suppressing MTA2 Degradation via Interacting with UCHL3. Int. J. Mol. Sci. 2024;25:2817. doi: 10.3390/ijms25052817. - DOI - PMC - PubMed
    1. Kim J.-m., Kim W.R., Park E.G., Lee D.H., Lee Y.J., Shin H.J., Jeong H.-s., Roh H.-Y., Kim H.-S. Exploring the Regulatory Landscape of Dementia: Insights from Non-Coding RNAs. Int. J. Mol. Sci. 2024;25:6190. doi: 10.3390/ijms25116190. - DOI - PMC - PubMed
    1. Wang S., Zhang K., Tan S., Xin J., Yuan Q., Xu H., Xu X., Liang Q., Christiani D.C., Wang M., et al. Circular RNAs in body fluids as cancer biomarkers: The new frontier of liquid biopsies. Mol. Cancer. 2021;20:13. doi: 10.1186/s12943-020-01298-z. - DOI - PMC - PubMed
    1. Wang C.C., Han C.D., Zhao Q., Chen X. Circular RNAs and complex diseases: From experimental results to computational models. Brief. Bioinform. 2021;22:bbab286. doi: 10.1093/bib/bbab286. - DOI - PMC - PubMed

LinkOut - more resources