Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Oct;40(4):671-82.
doi: 10.1007/s12038-015-9552-2.

pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts

Affiliations

pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts

Jyoti Rani et al. J Biosci. 2015 Oct.

Abstract

The PubMed literature database is a valuable source of information for scientific research. It is rich in biomedical literature with more than 24 million citations. Data-mining of voluminous literature is a challenging task. Although several text-mining algorithms have been developed in recent years with focus on data visualization, they have limitations such as speed, are rigid and are not available in the open source. We have developed an R package, pubmed.mineR, wherein we have combined the advantages of existing algorithms, overcome their limitations, and offer user flexibility and link with other packages in Bioconductor and the Comprehensive R Network (CRAN) in order to expand the user capabilities for executing multifaceted approaches. Three case studies are presented, namely, 'Evolving role of diabetes educators', 'Cancer risk assessment' and 'Dynamic concepts on disease and comorbidity' to illustrate the use of pubmed.mineR. The package generally runs fast with small elapsed times in regular workstations even on large corpus sizes and with compute intensive functions. The pubmed.mineR is available at http://cran.rproject. org/web/packages/pubmed.mineR.

PubMed Disclaimer

References

    1. Nat Methods. 2012 Nov;9(11):1069-76 - PubMed
    1. PLoS Comput Biol. 2013 Apr;9(4):e1003044 - PubMed
    1. Genome Biol. 2004;5(10):R80 - PubMed
    1. Am J Med Sci. 2013 Apr;345(4):307-13 - PubMed
    1. Nucleic Acids Res. 2011 Jan;39(Database issue):D52-7 - PubMed

Publication types

LinkOut - more resources