Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval
- PMID: 19633735
- PMCID: PMC2714909
- DOI: 10.1002/asi.20955
Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval
Abstract
One of the most significant recent advances in health information systems has been the shift from paper to electronic documents. While research on automatic text and image processing has taken separate paths, there is a growing need for joint efforts, particularly for electronic health records and biomedical literature databases. This work aims at comparing text-based versus image-based access to multimodal medical documents using state-of-the-art methods of processing text and image components. A collection of 180 medical documents containing an image accompanied by a short text describing it was divided into training and test sets. Content-based image analysis and natural language processing techniques are applied individually and combined for multimodal document analysis. The evaluation consists of an indexing task and a retrieval task based on the "gold standard" codes manually assigned to corpus documents. The performance of text-based and image-based access, as well as combined document features, is compared. Image analysis proves more adequate for both the indexing and retrieval of the images. In the indexing task, multimodal analysis outperforms both independent image and text analysis. This experiment shows that text describing images can be usefully analyzed in the framework of a hybrid text/image retrieval system.
Figures






Similar articles
-
A framework for biomedical figure segmentation towards image-based document retrieval.BMC Syst Biol. 2013;7 Suppl 4(Suppl 4):S8. doi: 10.1186/1752-0509-7-S4-S8. Epub 2013 Oct 23. BMC Syst Biol. 2013. PMID: 24565394 Free PMC article.
-
Text-based multi-dimensional medical images retrieval according to the features-usage correlation.Med Biol Eng Comput. 2021 Oct;59(10):1993-2017. doi: 10.1007/s11517-021-02392-0. Epub 2021 Aug 20. Med Biol Eng Comput. 2021. PMID: 34415513 Free PMC article.
-
Enhanced information retrieval from narrative German-language clinical text documents using automated document classification.Stud Health Technol Inform. 2008;136:473-8. Stud Health Technol Inform. 2008. PMID: 18487776
-
A review of content-based image retrieval systems in medical applications-clinical benefits and future directions.Int J Med Inform. 2004 Feb;73(1):1-23. doi: 10.1016/j.ijmedinf.2003.11.024. Int J Med Inform. 2004. PMID: 15036075 Review.
-
Evaluating performance of biomedical image retrieval systems--an overview of the medical image retrieval task at ImageCLEF 2004-2013.Comput Med Imaging Graph. 2015 Jan;39:55-61. doi: 10.1016/j.compmedimag.2014.03.004. Epub 2014 Mar 27. Comput Med Imaging Graph. 2015. PMID: 24746250 Free PMC article. Review.
Cited by
-
Content-Based Image Retrieval of Chest CT with Convolutional Neural Network for Diffuse Interstitial Lung Disease: Performance Assessment in Three Major Idiopathic Interstitial Pneumonias.Korean J Radiol. 2021 Feb;22(2):281-290. doi: 10.3348/kjr.2020.0603. Epub 2020 Oct 21. Korean J Radiol. 2021. PMID: 33169547 Free PMC article.
-
Medical Image Retrieval: A Multimodal Approach.Cancer Inform. 2015 Jul 22;13(Suppl 3):125-36. doi: 10.4137/CIN.S14053. eCollection 2014. Cancer Inform. 2015. PMID: 26309389 Free PMC article.
-
Content-based medical image retrieval: a survey of applications to multidimensional and multimodality data.J Digit Imaging. 2013 Dec;26(6):1025-39. doi: 10.1007/s10278-013-9619-2. J Digit Imaging. 2013. PMID: 23846532 Free PMC article. Review.
-
MR-Class: A Python Tool for Brain MR Image Classification Utilizing One-vs-All DCNNs to Deal with the Open-Set Recognition Problem.Cancers (Basel). 2023 Mar 17;15(6):1820. doi: 10.3390/cancers15061820. Cancers (Basel). 2023. PMID: 36980707 Free PMC article.
-
Towards case-based medical learning in radiological decision making using content-based image retrieval.BMC Med Inform Decis Mak. 2011 Oct 27;11:68. doi: 10.1186/1472-6947-11-68. BMC Med Inform Decis Mak. 2011. PMID: 22032775 Free PMC article.
References
-
- Aachen University of Technology Image retrieval in medical applications. 2008. Retrieved March 28, 2008, from http://www.irma-project.org/index_en.php.
-
- Aronson AR, Bodenreider O, Demner-Fushman D, Fung KW, Lee VK, Mork JG, et al. From indexing the biomedical literature to coding clinical text : Experience with MTI and machine learning approaches. In: Cohen K, Demner-Fushman D, Friedman C, Hirschman L, Pestian J, editors. Proceedings of the ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing (BioNLP); 2007. pp. 105–112.
-
- Boujemaa N, Fauqueur J, Gouet V. IAPR International Conference on Image and Signal Processing (ICISP’ 2003), Agadir, Morocco. What's beyond query by example? 2003. Jun, Retrieved August 29, 2008, from http://www-rocq.inria.fr/~gouet/Recherche/Papiers/icisp03/pdf.
-
- Briet S. Qu’est-ce que la documentation? [What is documentation?] Paris: Éditions Documentaires et Industrielles; 1951. [English translation retrieved April 29, 2007, from http://martinetl.free.fr/suzanne_briet.htm and archived at http://www.webcitation.org/5OUOoaTjW]
-
- Byrne K, Klein E. Image retrieval using natural language and content-based techniques. In: de Vries AP, editor. Proceedings of the Fourth Dutch-Belgian Information Retrieval Workshop (DIR 2003); Amsterdam: Institute for Logic, Language, and Computation; 2003. pp. 57–62.
Grants and funding
LinkOut - more resources
Full Text Sources