Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Nov 13:2010:26-30.

Clinical Case-based Retrieval Using Latent Topic Analysis

Affiliations

Clinical Case-based Retrieval Using Latent Topic Analysis

Corey W Arnold et al. AMIA Annu Symp Proc. .

Abstract

Clinical reporting is often performed with minimal consideration for secondary computational analysis of concepts. This fact makes the comparison of patients challenging as records lack a representation in a space where their similarity may be judged quantitatively. We present a method by which the entirety of a patient's clinical records may be compared using latent topics. To capture topics at a clinically relevant level, patient reports are partitioned based on their type, allowing for a more granular characterization of topics. The resulting probabilistic patient topic representations are directly comparable to one another using distance measures. To navigate a collection of patient records we have developed a workstation that allows users to weight different report types and displays succinct summarizations of why two patients are deemed similar, tailoring and expediting searches. Results show the system is able to capture clinically significant topics that can be used for case-based retrieval.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Graphical model of latent Dirichlet allocation (LDA). Boxes denote replication.
Figure 2.
Figure 2.
Test set log likelihood for LDA models.
Figure 3.
Figure 3.
Topic case-based retrieval workstation. (a) Adjustable weights for reporting domains. (b) Query patient information. (c) Highlighted reports and topics for query patient. (d) Clickable shared topics between patients. (e) Highlighted result reports and topics. (f) Most similar patients listed in descending order for selected query patient.

Similar articles

Cited by

References

    1. Schmidt R, Montani S, Bellazzi R, Portinale L, Gierl L. Cased-based reasoning for medical knowledge-based systems. International Journal of Medical Informatics. 2001;64(2–3):355–67. - PubMed
    1. Aamodt A, Plaza E. Case-based reasoning. Proc MLnet Summer School on Machine Learning and Knowledge Acquisition. 1994:1–58.
    1. Holland E. Glioblastoma multiforme: the terminator. National Acad Sciences. 2000:6242–4. - PMC - PubMed
    1. Blei D, Ng A, Jordan M. Latent Dirichlet allocation. Journal of Machine Learning Research. 2003;3(5):993–1022.
    1. Jordan M, editor. Learning in Graphical Models. Cambridge: MIT Press; 1999.

LinkOut - more resources