Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jul 1;8(2-3):151-154.
doi: 10.1016/j.websem.2010.04.002.

Structured Literature Image Finder: Parsing Text and Figures in Biomedical Literature

Affiliations

Structured Literature Image Finder: Parsing Text and Figures in Biomedical Literature

Amr Ahmed et al. Web Semant. .

Abstract

The SLIF project combines text-mining and image processing to extract structured information from biomedical literature. SLIF extracts images and their captions from published papers. The captions are automatically parsed for relevant biological entities (protein and cell type names), while the images are classified according to their type (e.g., micrograph or gel). Fluorescence microscopy images are further processed and classified according to the depicted subcellular localization. The results of this process can be queried online using either a user-friendly web-interface or an XML-based web-service. As an alternative to the targeted query paradigm, SLIF also supports browsing the collection based on latent topic models which are derived from both the annotated text and the image data. The SLIF web application, as well as labeled datasets used for training system components, is publicly available at http://slif.cbi.cmu.edu.

PubMed Disclaimer

Figures

Figure 1
Figure 1
SLIF Pipeline. This figure shows the paper processing pipeline.

Similar articles

Cited by

References

    1. Murphy RF, Velliste M, Yao J, Porreca G. Searching online journals for fluorescence microscope images depicting protein subcellular location patterns. BIBE '01: Proceedings of the 2nd IEEE International Symposium on Bioinformatics and Bioengineering, IEEE Computer Society; Washington, DC, USA. 2001. pp. 119–128.
    1. Cohen WW, Wang R, Murphy RF. Understanding captions in biomedical publications. KDD '03: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining; ACM, New York, NY, USA. 2003. pp. 499–504.
    1. Murphy RF, Kou Z, Hua J, Joffe M, Cohen WW. Extracting and structuring subcellular location information from on-line journal articles: The subcellular location image finder. Proceedings of the IASTED International Conference on Knowledge Sharing and Collaborative Engineering; 2004. pp. 109–114.
    1. Kou Z, Cohen WW, Murphy RF. A stacked graphical model for associating sub-images with sub-captions. Proceeding of Pacific Symposium on Biocomputing World Scientific. 2007:257–268. - PMC - PubMed
    1. Kou Z, Cohen WW, Murphy RF. Extracting information from text and images for location proteomics. In: Zaki MJ, Wang JTL, Toivonen H, editors. Proceedings of BIOKDD; 2003. pp. 2–9.

LinkOut - more resources