Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015;16 Suppl 10(Suppl 10):S2.
doi: 10.1186/1471-2105-16-S10-S2. Epub 2015 Jul 13.

Overview of the Cancer Genetics and Pathway Curation tasks of BioNLP Shared Task 2013

Overview of the Cancer Genetics and Pathway Curation tasks of BioNLP Shared Task 2013

Sampo Pyysalo et al. BMC Bioinformatics. 2015.

Abstract

Background: Since their introduction in 2009, the BioNLP Shared Task events have been instrumental in advancing the development of methods and resources for the automatic extraction of information from the biomedical literature. In this paper, we present the Cancer Genetics (CG) and Pathway Curation (PC) tasks, two event extraction tasks introduced in the BioNLP Shared Task 2013. The CG task focuses on cancer, emphasizing the extraction of physiological and pathological processes at various levels of biological organization, and the PC task targets reactions relevant to the development of biomolecular pathway models, defining its extraction targets on the basis of established pathway representations and ontologies.

Results: Six groups participated in the CG task and two groups in the PC task, together applying a wide range of extraction approaches including both established state-of-the-art systems and newly introduced extraction methods. The best-performing systems achieved F-scores of 55% on the CG task and 53% on the PC task, demonstrating a level of performance comparable to the best results achieved in similar previously proposed tasks.

Conclusions: The results indicate that existing event extraction technology can generalize to meet the novel challenges represented by the CG and PC task settings, suggesting that extraction methods are capable of supporting the construction of knowledge bases on the molecular mechanisms of cancer and the curation of biomolecular pathway models. The CG and PC tasks continue as open challenges for all interested parties, with data, tools and resources available from the shared task homepage.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Illustration of entity annotations. a) Cancer Genetics task b) Pathway Curation task. (Illustrations created with BRAT[59])
Figure 2
Figure 2
Illustration of relation annotations. a) Cancer Genetics task b) Pathway Curation task.
Figure 3
Figure 3
Illustration of event annotations. a) Cancer Genetics task b) Pathway Curation task.
Figure 4
Figure 4
Illustration of the data format. Adapted from [36].
Figure 5
Figure 5
Pathway model reactions and event representations. Illustration of reactions in a pathway model (left), idealized explicit statements annotated with a directly mapped representation (center), and realistic expressions in text with actual event annotation. Figure from [5].
Figure 6
Figure 6
Simple events. Events with single arguments are reliably extracted regardless of factors such as text domain or level or biological organization.
Figure 7
Figure 7
Complex events. Events involving multiple participants, recursive structure, and modifications continue to represent challenges for extraction.

Similar articles

Cited by

References

    1. Nédellec C, Bossy R, Kim JD, Kim JJ, Ohta T, Pyysalo S, Zweigenbaum P. Overview of BioNLP Shared Task 2013. Proceedings of the BioNLP Shared Task 2013 Workshop. 2013. pp. 1–7.
    1. Bossy R, Golik W, Ratkovic Z, Valsamou D, Bessières P, Nédellec C. An Overview of the Gene Regulation Network and the Bacteria Biotope Tasks in BioNLP'13 Shared Task. BMC Bioinformatics. 2014;16(Suppl 10) - PMC - PubMed
    1. Kim JD, Kim JJ, Han X, Rebholz-Schuhmann D. Extending the evaluation of Genia Event task toward knowledge base construction and comparison to Gene Regulation Ontology task. BMC Bioinformatics. 2014. - PMC - PubMed
    1. Pyysalo S, Ohta T, Ananiadou S. Overview of the Cancer Genetics (CG) task of BioNLP Shared Task 2013. Proceedings of the BioNLP Shared Task 2013 Workshop. 2013. pp. 58–66.
    1. Ohta T, Pyysalo S, Rak R, Rowley A, Chun HW, Jung SJ, Overview of the pathway curation (PC) task of BioNLP Shared Task 2013. Proceedings of the BioNLP Shared Task 2013 Workshop. 2013. pp. 67–75. - PMC - PubMed

Publication types