Multimodal Data Fusion for Whole-Slide Histopathology Image Classification
- PMID: 41230238
- PMCID: PMC12602746
- DOI: 10.1007/s41666-025-00212-w
Multimodal Data Fusion for Whole-Slide Histopathology Image Classification
Abstract
Whole slide images (WSIs) are critical for cancer diagnosis but pose computational challenges due to their gigapixel resolution. While automated AI tools can accelerate diagnostic workflows, they often rely on precise annotations and require substantial training data. Integrating multimodal data-such as WSIs and corresponding pathology reports-offers a promising solution to improve classification accuracy and reduce diagnostic variability. In this study, we introduce MPath-Net, an end-to-end multimodal framework that combines WSIs and pathology reports for enhanced cancer subtype classification. Using the TCGA dataset (1684 cases: 916 kidney, 768 lung), we applied multiple-instance learning (MIL) for WSI feature extraction and Sentence-BERT for report encoding, followed by joint fine-tuning for tumor classification. MPath-Net achieved 94.65% accuracy, 0.9553 precision, 0.9472 recall, and 0.9473 F1-score, significantly outperforming baseline models (P < 0.05). In addition, attention heatmaps provided interpretable tumor tissue localization, demonstrating the clinical utility of our approach. These findings suggest that MPath-Net can support pathologists by improving diagnostic accuracy, reducing inter-reader variability, and advancing precision medicine through multimodal AI integration.
Keywords: Classification; Clinical report; Multimodal learning; Pathology; Whole-slide image.
© The Author(s) 2025.
Conflict of interest statement
Competing interestsThe authors declare no competing interests.
Figures
References
-
- Silva LAV, Rohr K (2020) Pan-cancer prognosis prediction using multimodal deep learning. In: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE, pp 568–571
-
- Li S, Shi H, Sui D, Hao A and Qin H (2020) A novel pathological images and genomic data fusion framework for breast cancer survival prediction. 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE, p. 1384–7. - PubMed