Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval

Aurélie Névéol¹, Thomas M Deserno, Stéfan J Darmoni, Mark Oliver Güld, Alan R Aronson

Affiliations

PMID: 19633735
PMCID: PMC2714909
DOI: 10.1002/asi.20955

Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval

Aurélie Névéol et al. J Am Soc Inf Sci Technol. 2008.

. 2008 Sep 18;60(1):123-134.

doi: 10.1002/asi.20955.

Authors

Aurélie Névéol¹, Thomas M Deserno, Stéfan J Darmoni, Mark Oliver Güld, Alan R Aronson

Affiliation

¹ U.S. National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894. E-mail: neveola@nlm.nih.gov.

PMID: 19633735
PMCID: PMC2714909
DOI: 10.1002/asi.20955

Abstract

One of the most significant recent advances in health information systems has been the shift from paper to electronic documents. While research on automatic text and image processing has taken separate paths, there is a growing need for joint efforts, particularly for electronic health records and biomedical literature databases. This work aims at comparing text-based versus image-based access to multimodal medical documents using state-of-the-art methods of processing text and image components. A collection of 180 medical documents containing an image accompanied by a short text describing it was divided into training and test sets. Content-based image analysis and natural language processing techniques are applied individually and combined for multimodal document analysis. The evaluation consists of an indexing task and a retrieval task based on the "gold standard" codes manually assigned to corpus documents. The performance of text-based and image-based access, as well as combined document features, is compared. Image analysis proves more adequate for both the indexing and retrieval of the images. In the indexing task, multimodal analysis outperforms both independent image and text analysis. This experiment shows that text describing images can be usefully analyzed in the framework of a hybrid text/image retrieval system.

PubMed Disclaimer

Figures

**Figure 1**
Sample document in the test corpus. An English translation of the text is provided by the authors for illustration purposes: Caption: “François, 1 1/2 month: Abdomen without preparation.” Paragraph: “Based on the child’s age and the fact that he experienced projectile vomiting, a diagnosis of pyloric stenosis can be made. An emergency abdomen without preparation is ordered for François; no evidence of gastric dilatation is visible.”

**Figure 2**
Processing multimodal biomedical documents for information retrieval.

**Figure 3**
Set of images in the test corpus matching the query “1121-115-700-400”.

**Figure 4**
Text and image analysis performed to assign IRMA codes to a test document. Text analysis (solid line and box) involved applying a dictionary and using the training collection to retrieve the 5 nearest neighbors (5-NN) for the text portion of a query document. The image analysis (dashed lines and boxes) used either the training collection or the IRMA database to retrieve the k nearest neighbors (k-NN) based on content-based image retrieval (CBIR) features.

**Figure 5**
Sample IRMA-MeSH equivalences made into dictionary entries (using MeSH hierarchy).

**Figure 6**
Sample IRMA-MeSH equivalences made into dictionary entries (adapting to IRMA specificity).

See this image and copyright information in PMC

References

1. Aachen University of Technology Image retrieval in medical applications. 2008. Retrieved March 28, 2008, from http://www.irma-project.org/index_en.php.
1. Aronson AR, Bodenreider O, Demner-Fushman D, Fung KW, Lee VK, Mork JG, et al. From indexing the biomedical literature to coding clinical text : Experience with MTI and machine learning approaches. In: Cohen K, Demner-Fushman D, Friedman C, Hirschman L, Pestian J, editors. Proceedings of the ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing (BioNLP); 2007. pp. 105–112.
1. Boujemaa N, Fauqueur J, Gouet V. IAPR International Conference on Image and Signal Processing (ICISP’ 2003), Agadir, Morocco. What's beyond query by example? 2003. Jun, Retrieved August 29, 2008, from http://www-rocq.inria.fr/~gouet/Recherche/Papiers/icisp03/pdf.
1. Briet S. Qu’est-ce que la documentation? [What is documentation?] Paris: Éditions Documentaires et Industrielles; 1951. [English translation retrieved April 29, 2007, from http://martinetl.free.fr/suzanne_briet.htm and archived at http://www.webcitation.org/5OUOoaTjW]
1. Byrne K, Klein E. Image retrieval using natural language and content-based techniques. In: de Vries AP, editor. Proceedings of the Fourth Dutch-Belgian Information Retrieval Workshop (DIR 2003); Amsterdam: Institute for Logic, Language, and Computation; 2003. pp. 57–62.

Grants and funding

Z99 LM999999/ImNIH/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval

Affiliation

Natural Language Processing Versus Content-Based Image Analysis for Medical Document Retrieval

Authors

Affiliation

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources