Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Apr 3:3:2.
doi: 10.1186/1742-5581-3-2.

Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy

Affiliations

Conceptual biology, hypothesis discovery, and text mining: Swanson's legacy

Tanja Bekhuis. Biomed Digit Libr. .

Abstract

Innovative biomedical librarians and information specialists who want to expand their roles as expert searchers need to know about profound changes in biology and parallel trends in text mining. In recent years, conceptual biology has emerged as a complement to empirical biology. This is partly in response to the availability of massive digital resources such as the network of databases for molecular biologists at the National Center for Biotechnology Information. Developments in text mining and hypothesis discovery systems based on the early work of Swanson, a mathematician and information scientist, are coincident with the emergence of conceptual biology. Very little has been written to introduce biomedical digital librarians to these new trends. In this paper, background for data and text mining, as well as for knowledge discovery in databases (KDD) and in text (KDT) is presented, then a brief review of Swanson's ideas, followed by a discussion of recent approaches to hypothesis discovery and testing. 'Testing' in the context of text mining involves partially automated methods for finding evidence in the literature to support hypothetical relationships. Concluding remarks follow regarding (a) the limits of current strategies for evaluation of hypothesis discovery systems and (b) the role of literature-based discovery in concert with empirical research. Report of an informatics-driven literature review for biomarkers of systemic lupus erythematosus is mentioned. Swanson's vision of the hidden value in the literature of science and, by extension, in biomedical digital databases, is still remarkably generative for information scientists, biologists, and physicians.

PubMed Disclaimer

References

    1. Bray D. Reasoning for results. Nature. 2001;412:863. - PubMed
    1. Blagosklonny MV, Pardee AB. Unearthing the gems. Nature. 2002;416:373. - PubMed
    1. Swanson DR. Medical literature as a potential source of new knowledge. Bulletin of the Medical Library Association. 1990;78:29–37. - PMC - PubMed
    1. Theoretical Biology and Medical Modelling http://www.tbiomed.com
    1. NCBI resource guide http://www.ncbi.nlm.nih.gov/Sitemap/ResourceGuide.html

LinkOut - more resources