This is a preprint.
DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era
- PMID: 38746100
- PMCID: PMC11092829
- DOI: 10.21203/rs.3.rs-4277992/v1
DF-DM: A foundational process model for multimodal data fusion in the artificial intelligence era
Abstract
In the big data era, integrating diverse data modalities poses significant challenges, particularly in complex fields like healthcare. This paper introduces a new process model for multimodal Data Fusion for Data Mining, integrating embeddings and the Cross-Industry Standard Process for Data Mining with the existing Data Fusion Information Group model. Our model aims to decrease computational costs, complexity, and bias while improving efficiency and reliability. We also propose "disentangled dense fusion," a novel embedding fusion method designed to optimize mutual information and facilitate dense inter-modality feature interaction, thereby minimizing redundant information. We demonstrate the model's efficacy through three use cases: predicting diabetic retinopathy using retinal images and patient metadata, domestic violence prediction employing satellite imagery, internet, and census data, and identifying clinical and demographic features from radiography images and clinical notes. The model achieved a Macro F1 score of 0.92 in diabetic retinopathy prediction, an R-squared of 0.854 and sMAPE of 24.868 in domestic violence prediction, and a macro AUC of 0.92 and 0.99 for disease prediction and sex classification, respectively, in radiological analysis. These results underscore the Data Fusion for Data Mining model's potential to significantly impact multimodal data processing, promoting its adoption in diverse, resource-constrained settings.
Keywords: Data Fusion; Embeddings; Foundational Models; Multimodal Data.
Conflict of interest statement
Additional Declarations: No competing interests reported.
Figures






Similar articles
-
An End-to-End Natural Language Processing Application for Prediction of Medical Case Coding Complexity: Algorithm Development and Validation.JMIR Med Inform. 2023 Jan 19;11:e38150. doi: 10.2196/38150. JMIR Med Inform. 2023. PMID: 36656627 Free PMC article.
-
HFBSurv: hierarchical multimodal fusion with factorized bilinear models for cancer survival prediction.Bioinformatics. 2022 Apr 28;38(9):2587-2594. doi: 10.1093/bioinformatics/btac113. Bioinformatics. 2022. PMID: 35188177 Free PMC article.
-
Multimodal multi-instance evidence fusion neural networks for cancer survival prediction.Sci Rep. 2025 Mar 26;15(1):10470. doi: 10.1038/s41598-025-93770-3. Sci Rep. 2025. PMID: 40140434 Free PMC article.
-
Diabetic retinopathy screening through artificial intelligence algorithms: A systematic review.Surv Ophthalmol. 2024 Sep-Oct;69(5):707-721. doi: 10.1016/j.survophthal.2024.05.008. Epub 2024 Jun 15. Surv Ophthalmol. 2024. PMID: 38885761
-
A multimodal Parkinson quantification by fusing eye and gait motion patterns, using covariance descriptors, from non-invasive computer vision.Comput Methods Programs Biomed. 2022 Mar;215:106607. doi: 10.1016/j.cmpb.2021.106607. Epub 2021 Dec 30. Comput Methods Programs Biomed. 2022. PMID: 34998167 Review.
References
-
- Goodwin P.: Tape and cloud: Solving storage problems in the zettabyte era o f data
-
- Pan I., Mason L.R., Matar O.K.: Data-centric engineering: integrating simulation, machine learning and statistics. challenges and opportunities 249, 117271 10.1016/j.ces.2021.117271 - DOI
-
- Furman J., Seamans R.: AI and the economy 19, 161–191 10.1086/699936. eprint: - DOI
-
- Shaik T., Tao X., Li L., Xie H., Velásquez J.D.: A survey of multimodal information fusion for smart healthcare: Mapping the journey from data to wisdom 102, 102040 10.1016/j.inffus.2023.102040 - DOI
-
- Ma D., Dang B., Li S., Zang H., Dong X.: Implementation of computer vision technology based on artificial intelligence for medical image analysis. International Journal of Computer Science and Information Technology 1(1), 69–76 (2023)
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous