Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jul;5(7):e1000411.
doi: 10.1371/journal.pcbi.1000411. Epub 2009 Jul 31.

Getting started in text mining: part two

Affiliations

Getting started in text mining: part two

Andrey Rzhetsky et al. PLoS Comput Biol. 2009 Jul.
No abstract available

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Major techniques and applications of text mining.
It is common to divide the task of text mining into information retrieval, named-entity recognition, and information extraction. Extracted information can be further used for building systems for answering questions, fusing experimental data with literature-derived information, implementing computational creativity (discovering esoteric connections between facts, matching solutions in one field with open problems in another one, capturing cliques of internally consistent observations that are inconsistent across cliques), and analysis of large-scale dynamics of scientific fields.

Similar articles

Cited by

References

    1. Cohen KB, Hunter L. Getting started in text mining. PLoS Comput Biol. 2008;4:e20. doi: 10.1371/journal.pcbi.0040020. - DOI - PMC - PubMed
    1. Hersh W, Hickam D. Information retrieval in medicine: The SAPHIRE experience. Medinfo. 1995;8(Part 2):1433–1437. - PubMed
    1. Hirschman L, Morgan AA, Yeh AS. Rutabaga by any other name: Extracting biological names. J Biomed Inform. 2002;35:247–259. - PubMed
    1. Kim JD, Ohta T, Tsujii J. Corpus annotation for mining biomedical events from literature. BMC Bioinformatics. 2008;9:10. - PMC - PubMed
    1. Sasaki Y, Tsuruoka Y, McNaught J, Ananiadou S. How to make the most of NE dictionaries in statistical NER. BMC Bioinformatics. 2008;9(Supplement 11):S5. - PMC - PubMed

Publication types

MeSH terms