The ParlaMint corpora of parliamentary proceedings
- PMID: 35125984
- PMCID: PMC8807381
- DOI: 10.1007/s10579-021-09574-0
The ParlaMint corpora of parliamentary proceedings
Abstract
This paper presents the ParlaMint corpora containing transcriptions of the sessions of the 17 European national parliaments with half a billion words. The corpora are uniformly encoded, contain rich meta-data about 11 thousand speakers, and are linguistically annotated following the Universal Dependencies formalism and with named entities. Samples of the corpora and conversion scripts are available from the project's GitHub repository, and the complete corpora are openly available via the CLARIN.SI repository for download, as well as through the NoSketch Engine and KonText concordancers and the Parlameter interface for on-line exploration and analysis.
Keywords: Comparable corpora; Parliamentary proceedings; TEI.
© The Author(s) 2022.
Figures
References
-
- Bayley P. Introduction: The whys and wherefores of analyzing parliamentary discourse. In: Bayley P, editor. Cross-cultural perspectives on parliamentary discourse. John Benjamins Publishing; 2014. pp. 1–44.
-
- Calabretta, I., Dalton, C., Griscom, R., Kołczyńska, M., Pahor de Maiti, K., & Ros, R. (2021). Parliamentary debates in the COVID times. Retrieved from https://dhhackathon.wordpress.com/2021/05/28/parliamentary-debates-in-th...
-
- Calzada Perez M. Corpus-based methods for comparative translation and interpreting studies: Mapping differences and similarities with traditional and innovative tools. Translation and Interpreting Studies. 2017;12:231–252. doi: 10.1075/tis.12.2.03cal. - DOI
-
- Cheng JE. Islamophobia, Muslimophobia or racism? Parliamentary discourses on Islam and Muslims in debates on the minaret ban in Switzerland. Discourse & Society. 2015;26(5):562–586. doi: 10.1177/0957926515581157. - DOI
-
- Çöltekin, Ç. (2010). A freely available morphological analyzer for Turkish. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC) (pp. 820–827). Retrieved from http://www.lrec-conf.org/proceedings/lrec2010/summaries/109.html
LinkOut - more resources
Full Text Sources