Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004;107(Pt 1):292-6.

Text mining functional keywords associated with genes

Affiliations
  • PMID: 15360821

Text mining functional keywords associated with genes

Ying Liu et al. Stud Health Technol Inform. 2004.

Abstract

Modern experimental techniques provide the ability to gather vast amounts of biological data in a single experiment (e.g. DNA microarray experiment), making it extremely difficult for the researcher to interpret the data and form conclusions about the functions of the genes. Current approaches provide useful information that organizes or relates genes, but a major shortcoming is they either do not address specific functions of the genes or are constrained by functions predefined in other databases, which can be biased, incomplete, or out-of-date. We extended Andrade and Valencia's method [1] to statistically mine functional keywords associated with genes from MEDLINE abstracts. The MEDLINE abstracts are analyzed statistically to score and rank keywords for each gene using a background set of words for baseline frequencies. We generally got very good functional keyword information about the genes we tested, which was confirmed by searching for the individual keywords in context. The keywords extracted by our algorithm reveal a wealth of potential functional concepts, which were not represented in existing public databases. We feel that this approach is general enough to apply to medical and biological literature to find other relationships: drugs vs. genes, risk-factors vs. genes, etc.

PubMed Disclaimer

Publication types

LinkOut - more resources