Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct:2024:538-548.
doi: 10.1145/3627673.3679574. Epub 2024 Oct 21.

iMIRACLE: an Iterative Multi-View Graph Neural Network to Model Intercellular Gene Regulation from Spatial Transcriptomic Data

Affiliations

iMIRACLE: an Iterative Multi-View Graph Neural Network to Model Intercellular Gene Regulation from Spatial Transcriptomic Data

Ziheng Duan et al. Proc ACM Int Conf Inf Knowl Manag. 2024 Oct.

Abstract

Spatial transcriptomics has transformed genomic research by measuring spatially resolved gene expressions, allowing us to investigate how cells adapt to their microenvironment via modulating their expressed genes. This essential process usually starts from cell-cell communication (CCC) via ligand-receptor (LR) interaction, leading to regulatory changes within the receiver cell. However, few methods were developed to connect them to provide biological insights into intercellular regulation. To fill this gap, we propose iMiracle, an iterative multi-view graph neural network that models each cell's intercellular regulation with three key features. Firstly, iMiracle integrates inter- and intra-cellular networks to jointly estimate cell-type- and micro-environment-driven gene expressions. Optionally, it allows prior knowledge of intra-cellular networks as pre-structured masks to maintain biological relevance. Secondly, iMiracle employs iterative learning to overcome the sparsity of spatial transcriptomic data and gradually fill in the missing edges in the CCC network. Thirdly, iMiracle infers a cell-specific ligand-gene regulatory score based on the contributions of different LR pairs to interpret inter-cellular regulation. We applied iMiracle to nine simulated and eight real datasets from three sequencing platforms and demonstrated that iMiracle consistently outperformed ten methods in gene expression imputation and four methods in regulatory score inference. Lastly, we developed iMiracle as an open-source software and anticipate that it can be a powerful tool in decoding the complexities of inter-cellular transcriptional regulation.

Keywords: cell–ell communications; graph neural networks; inter-cellular gene regulation; spatial transcriptomics.

PubMed Disclaimer

Figures

Figure 7:
Figure 7:
Additional regulatory score inference results.
Figure 1:
Figure 1:
Overview of iMiracle. iMiracle initiates with a sparse cell-by-gene matrix XobsRn×m (n cells and m genes), spatial coordinates CRn×2, and cell type information TRn×t (t cell types). It constructs multi-view cell-cell communication networks GC to model various ligand-receptor interactions. Node embeddings for each cell are generated per view through a graph neural network. A multilayer perceptron then decodes gene expression Xˆs influenced by these LR interactions, integrating knowledge from an established gene regulatory network. iMiracle isolates the baseline gene expression matrix Xˆb solely determined by cell types. The final imputed gene expression matrix Xˆ merges the baseline matrix with expressions from ligand-receptor interactions. Through iterative learning, Xˆ is used to progressively refine the multi-view graph, enhancing both imputation precision and the inference of ligand-to-gene regulatory scores.
Figure 2:
Figure 2:
iMiracle fully delineates the CCC network via iterative learning, uncovering up to 181% more LR interactions in 10x Visium, 67% in Stereoseq, and 153% in SlideseqV2.
Figure 3:
Figure 3:
iMiracle consistently outperforms other methods in the regulatory score inference. (A) In this simulation, 1000 cells are spatially arranged in a 100-unit square. Cell types were determined by their locations, incorporating high-expression zones for various LR pairs, to realistically model gene expression and CCC dynamics. This setup is utilized for inferring cellular-level regulatory scores. (B) Benchmarking results demonstrate iMiracle’s superior accuracy in inferring ligand-gene regulatory scores, surpassing all four baselines across all four metrics.
Figure 4:
Figure 4:
iMiracle reveals substantial regulatory heterogeneity across cells. (A) Detailed ground truth segmentation of the cortical layers and white matter (WM) within the DLPFC section of sample 151507. (B) Visualization of the observed expression pattern of GJA1. (C) Prediction of the baseline expression profile for GJA1. (D) Prediction of the LR-regulated expression for GJA1. (E) Identification of three key regions within sample 151507. (F-H) Top three LR interactions and their corresponding regulatory scores. LR pairs PTN-SDC4 and LRRC4B-PTPRD were discovered through an iterative learning approach, indicated by their red colors. (I) Heatmap illustration of the percentage of cells featuring the top five LR pairs in each identified region. (J-L) Jaccard similarity of the top five LR pairs for cells within each region, revealing substantial regulatory heterogeneity across cells.
Figure 5:
Figure 5:
Variant analysis. (A) RMSE w.r.t. different variants of iMiracle for gene imputation. (B) Four regulatory score inference metrics w.r.t. variants of iMiracle. (C) Four regulatory score inference metrics w.r.t. GNN architectures.
Figure 6:
Figure 6:
Parameter analysis. (A) Precision w.r.t. different blending coefficient α. (B) Precision w.r.t. different hidden dimension d. (B) Precision w.r.t. different number of neighbors k for graph construction. (B) Precision w.r.t. GNN layers L.

Similar articles

References

    1. Benotmane Jasim Kada, Kueckelhaus Jan, Will Paulina, Zhang Junyi, Ravi Vidhya M, Joseph Kevin, Sankowski Roman, Beck Jürgen, Lee-Chang Catalina, Schnell Oliver, et al. 2023. High-sensitive spatially resolved T cell receptor sequencing with SPTCR-seq. Nature communications 14, 1 (2023), 7432. - PMC - PubMed
    1. Biancalani Tommaso, Scalia Gabriele, Buffoni Lorenzo, Avasthi Raghav, Lu Ziqing, Sanger Aman, Tokcan Neriman, Vanderburg Charles R., Segerstolpe Åsa, Zhang Meng, Avraham-Davidi Inbal, Vickovic Sanja, Nitzan Mor, Ma Sai, Subramanian Ayshwarya, Lipinski Michal, Buenrostro Jason, Brown Nik Bear, Fanelli Duccio, Zhuang Xiaowei, Macosko Evan Z., and Regev Aviv. 2021. Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram. Nature Methods 18, 11 (01 Nov 2021), 1352–1362. 10.1038/s41592-021-01264-7 - DOI - PMC - PubMed
    1. Browaeys Robin, Saelens Wouter, and Saeys Yvan. 2020. NicheNet: modeling intercellular communication by linking ligands to target genes. Nature methods 17, 2 (2020), 159–162. - PubMed
    1. Cao Yingxin, Fu Laiyi, Wu Jie, Peng Qinke, Nie Qing, Zhang Jing, and Xie Xiaohui. 2021. SAILER: scalable and accurate invariant representation learning for single-cell ATAC-seq processing and integration. Bioinformatics 37, Supplement_1 (2021), i317–i326. - PMC - PubMed
    1. Chen Ao, Liao Sha, Cheng Mengnan, Ma Kailong, Wu Liang, Lai Yiwei, Yang Jin, Li Wenjiao, Xu Jiangshan, Hao Shijie, et al. 2021. Large field of view-spatially resolved transcriptomics at nanoscale resolution. BioRxiv 2021 (2021).