Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Nov 17:2012:bas043.
doi: 10.1093/database/bas043. Print 2012.

Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II

Affiliations

Biocuration workflows and text mining: overview of the BioCreative 2012 Workshop Track II

Zhiyong Lu et al. Database (Oxford). .

Abstract

Manual curation of data from the biomedical literature is a rate-limiting factor for many expert curated databases. Despite the continuing advances in biomedical text mining and the pressing needs of biocurators for better tools, few existing text-mining tools have been successfully integrated into production literature curation systems such as those used by the expert curated databases. To close this gap and better understand all aspects of literature curation, we invited submissions of written descriptions of curation workflows from expert curated databases for the BioCreative 2012 Workshop Track II. We received seven qualified contributions, primarily from model organism databases. Based on these descriptions, we identified commonalities and differences across the workflows, the common ontologies and controlled vocabularies used and the current and desired uses of text mining for biocuration. Compared to a survey done in 2009, our 2012 results show that many more databases are now using text mining in parts of their curation workflows. In addition, the workshop participants identified text-mining aids for finding gene names and symbols (gene indexing), prioritization of documents for curation (document triage) and ontology concept assignment as those most desired by the biocurators. DATABASE URL: http://www.biocreative.org/tasks/bc-workshop-2012/workflow/.

PubMed Disclaimer

References

    1. Arighi CN, Lu Z, Krallinger M, et al. Overview of the BioCreative III Workshop. BMC Bioinformatics. 2011;12(Suppl. 8):S1. - PMC - PubMed
    1. Hirschman L, Yeh A, Blaschke C, et al. Overview of BioCreAtIvE: critical assessment of information extraction for biology. BMC Bioinformatics. 2005;6(Suppl. 1):S1. - PMC - PubMed
    1. Leitner F, Mardis SA, Krallinger M, et al. An overview of BioCreative II.5. IEEE/ACM Trans. Comput. Biol. Bioinform. 2010;7:385–399. - PubMed
    1. Krallinger M, Morgan A, Smith L, et al. Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge. Genome Biol. 2008;9(Suppl. 2):S1. - PMC - PubMed
    1. Wu CH, Arighi C, Cohen KB, et al. Editorial: BioCreative-2012 virtual issue. Database. 2012 doi: 10.1093/database/bas049. - PMC - PubMed

Publication types