. 2012 Jun 26;13 Suppl 11(Suppl 11):S1.

doi: 10.1186/1471-2105-13-S11-S1.

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Jin-Dong Kim¹, Ngan Nguyen, Yue Wang, Jun'ichi Tsujii, Toshihisa Takagi, Akinori Yonezawa

Affiliations

PMID: 22759455
PMCID: PMC3384256
DOI: 10.1186/1471-2105-13-S11-S1

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Jin-Dong Kim et al. BMC Bioinformatics. 2012.

. 2012 Jun 26;13 Suppl 11(Suppl 11):S1.

doi: 10.1186/1471-2105-13-S11-S1.

Authors

Jin-Dong Kim¹, Ngan Nguyen, Yue Wang, Jun'ichi Tsujii, Toshihisa Takagi, Akinori Yonezawa

Affiliation

¹ Database Center for Life Science, Research Organization of Information and Science, 2-11-16 Yayoi, Bunkyo-ku, Tokyo, Japan. jdkim@dbcls.rois.ac.jp

PMID: 22759455
PMCID: PMC3384256
DOI: 10.1186/1471-2105-13-S11-S1

Abstract

Background: The Genia task, when it was introduced in 2009, was the first community-wide effort to address a fine-grained, structural information extraction from biomedical literature. Arranged for the second time as one of the main tasks of BioNLP Shared Task 2011, it aimed to measure the progress of the community since 2009, and to evaluate generalization of the technology to full text papers. The Protein Coreference task was arranged as one of the supporting tasks, motivated from one of the lessons of the 2009 task that the abundance of coreference structures in natural language text hinders further improvement with the Genia task.

Results: The Genia task received final submissions from 15 teams. The results show that the community has made a significant progress, marking 74% of the best F-score in extracting bio-molecular events of simple structure, e.g., gene expressions, and 45% ~ 48% in extracting those of complex structure, e.g., regulations. The Protein Coreference task received 6 final submissions. The results show that the coreference resolution performance in biomedical domain is lagging behind that in newswire domain, cf. 50% vs. 66% in MUC score. Particularly, in terms of protein coreference resolution the best system achieved 34% in F-score.

Conclusions: Detailed analysis performed on the results improves our insight into the problem and suggests the directions for further improvements.

PubMed Disclaimer

Figures

**Figure 1**
**Event annotation example**.

**Figure 2**
**Protein coreference annotation**.

**Figure 3**
**Event distribution in different sections**. The interval of the contour lines is 5%. For example, in the *Methods* and *Caption* sections, 40% of the events are of *Gene*_expression.

See this image and copyright information in PMC

Cited by

A generalizable NLP framework for fast development of pattern-based biomedical relation extraction systems.
Peng Y, Torii M, Wu CH, Vijay-Shanker K. Peng Y, et al. BMC Bioinformatics. 2014 Aug 23;15(1):285. doi: 10.1186/1471-2105-15-285. BMC Bioinformatics. 2014. PMID: 25149151 Free PMC article.
An analysis on the entity annotations in biological corpora.
Neves M. Neves M. F1000Res. 2014 Apr 25;3:96. doi: 10.12688/f1000research.3216.1. eCollection 2014. F1000Res. 2014. PMID: 25254099 Free PMC article. Review.
Annotation and detection of drug effects in text for pharmacovigilance.
Thompson P, Daikou S, Ueno K, Batista-Navarro R, Tsujii J, Ananiadou S. Thompson P, et al. J Cheminform. 2018 Aug 13;10(1):37. doi: 10.1186/s13321-018-0290-y. J Cheminform. 2018. PMID: 30105604 Free PMC article.
BioC: a minimalist approach to interoperability for biomedical text processing.
Comeau DC, Islamaj Doğan R, Ciccarese P, Cohen KB, Krallinger M, Leitner F, Lu Z, Peng Y, Rinaldi F, Torii M, Valencia A, Verspoor K, Wiegers TC, Wu CH, Wilbur WJ. Comeau DC, et al. Database (Oxford). 2013 Sep 18;2013:bat064. doi: 10.1093/database/bat064. Print 2013. Database (Oxford). 2013. PMID: 24048470 Free PMC article.
Broad-coverage biomedical relation extraction with SemRep.
Kilicoglu H, Rosemblat G, Fiszman M, Shin D. Kilicoglu H, et al. BMC Bioinformatics. 2020 May 14;21(1):188. doi: 10.1186/s12859-020-3517-7. BMC Bioinformatics. 2020. PMID: 32410573 Free PMC article.

See all "Cited by" articles

References

1. Kim JD, Ohta T, Pyysalo S, Kano Y, Tsujii J. Overview of BioNLP'09 Shared Task on Event Extraction. Proceedings of Natural Language Processing in Biomedicine (BioNLP) NAACL 2009 Workshop. 2009. pp. 1–9.http://aclweb.org/anthology-new/W/W09/W09-1401.pdf
1. Miwa M, Sætre R, Kim JD, Tsujii J. Event Extraction with Complex Event Classification Using Rich Features. Journal of Bioinformatics and Computational Biology (JBCB) 2010;8:131–146. doi: 10.1142/S0219720010004586. http://www.worldscinet.com/jbcb/08/0801/S0219720010004586.html - DOI - PubMed
1. Poon H, Vanderwende L. Joint Inference for Knowledge Extraction from Biomedical Literature. Proceedings of NAACL-HLT'10. 2010. pp. 813–821.http://aclweb.org/anthology-new/N/N10/N10-1123.pdf
1. Vlachos A. Two Strong Baselines for the BioNLP 2009 Event Extraction Task. Proceedings of BioNLP'10. 2010. pp. 1–9.http://aclweb.org/anthology-new/W/W10/W10-1901.pdf
1. Miwa M, Pyysalo S, Hara T, Tsujii J. A Comparative Study of Syntactic Parsers for Event Extraction. Proceedings of BioNLP'10. 2010. pp. 37–45.http://aclweb.org/anthology-new/W/W10/W10-1905.pdf

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Affiliation

The Genia Event and Protein Coreference tasks of the BioNLP Shared Task 2011

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources