Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2001:17 Suppl 1:S97-106.
doi: 10.1093/bioinformatics/17.suppl_1.s97.

Disambiguating proteins, genes, and RNA in text: a machine learning approach

Affiliations
Comparative Study

Disambiguating proteins, genes, and RNA in text: a machine learning approach

V Hatzivassiloglou et al. Bioinformatics. 2001.

Abstract

We present an automated system for assigning protein, gene, or mRNA class labels to biological terms in free text. Three machine learning algorithms and several extended ways for defining contextual features for disambiguation are examined, and a fully unsupervised manner for obtaining training examples is proposed. We train and evaluate our system over a collection of 9 million words of molecular biology journal articles, obtaining accuracy rates up to 85%.

PubMed Disclaimer

Publication types