HE2Gene: image-to-RNA translation via multi-task learning for spatial transcriptomics data
- PMID: 38837395
- PMCID: PMC11164830
- DOI: 10.1093/bioinformatics/btae343
HE2Gene: image-to-RNA translation via multi-task learning for spatial transcriptomics data
Abstract
Motivation: Tissue context and molecular profiling are commonly used measures in understanding normal development and disease pathology. In recent years, the development of spatial molecular profiling technologies (e.g. spatial resolved transcriptomics) has enabled the exploration of quantitative links between tissue morphology and gene expression. However, these technologies remain expensive and time-consuming, with subsequent analyses necessitating high-throughput pathological annotations. On the other hand, existing computational tools are limited to predicting only a few dozen to several hundred genes, and the majority of the methods are designed for bulk RNA-seq.
Results: In this context, we propose HE2Gene, the first multi-task learning-based method capable of predicting tens of thousands of spot-level gene expressions along with pathological annotations from H&E-stained images. Experimental results demonstrate that HE2Gene is comparable to state-of-the-art methods and generalizes well on an external dataset without the need for re-training. Moreover, HE2Gene preserves the annotated spatial domains and has the potential to identify biomarkers. This capability facilitates cancer diagnosis and broadens its applicability to investigate gene-disease associations.
Availability and implementation: The source code and data information has been deposited at https://github.com/Microbiods/HE2Gene.
© The Author(s) 2024. Published by Oxford University Press.
Conflict of interest statement
No competing interest is declared.
Figures



Similar articles
-
Integrating spatial transcriptomics and bulk RNA-seq: predicting gene expression with enhanced resolution through graph attention networks.Brief Bioinform. 2024 May 23;25(4):bbae316. doi: 10.1093/bib/bbae316. Brief Bioinform. 2024. PMID: 38960406 Free PMC article.
-
Multimodal contrastive learning for spatial gene expression prediction using histology images.Brief Bioinform. 2024 Sep 23;25(6):bbae551. doi: 10.1093/bib/bbae551. Brief Bioinform. 2024. PMID: 39471412 Free PMC article.
-
A multi-modality and multi-granularity collaborative learning framework for identifying spatial domains and spatially variable genes.Bioinformatics. 2024 Oct 1;40(10):btae607. doi: 10.1093/bioinformatics/btae607. Bioinformatics. 2024. PMID: 39418177 Free PMC article.
-
Advances in spatial transcriptomics and its applications in cancer research.Mol Cancer. 2024 Jun 20;23(1):129. doi: 10.1186/s12943-024-02040-9. Mol Cancer. 2024. PMID: 38902727 Free PMC article. Review.
-
Statistical Power Analysis for Designing Bulk, Single-Cell, and Spatial Transcriptomics Experiments: Review, Tutorial, and Perspectives.Biomolecules. 2023 Jan 24;13(2):221. doi: 10.3390/biom13020221. Biomolecules. 2023. PMID: 36830591 Free PMC article. Review.
Cited by
-
ResSAT: Enhancing Spatial Transcriptomics Prediction from H&E- Stained Histology Images with Interactive Spot Transformer.Res Sq [Preprint]. 2024 Aug 9:rs.3.rs-4707959. doi: 10.21203/rs.3.rs-4707959/v1. Res Sq. 2024. PMID: 39149477 Free PMC article. Preprint.
References
-
- Andersson A, Larsson L, Stenbeck L. et al. Spatial deconvolution of HER2-positive breast tumors reveals novel intercellular relationships. bioRxiv, 2020, 2020-07. doi: 10.1101/2020.07.14.200600 - DOI
-
- Duran-Lopez L, Dominguez-Morales JP, Conde-Martin AF. et al. Prometeo: a CNN-based computer-aided diagnosis system for WSI prostate cancer detection. IEEE Access 2020;8:128613–28.