ParlaMint, a CLARIN flagship project, resulted in the creation of comparable corpora of parliamentary debates of 29 European countries and autonomous regions, covering at least the period from 2015 to 2022, and containing over 1 billion words. The corpora are uniformly encoded, contain rich metadata about their 24 thousand speakers, and are linguistically annotated up to the level of Universal Dependencies syntax and named entities.

The role of the ACDH-CH was to provide and encode the Austrian data for the ParlaMint Project. The ParlaMint-AT contains data from 1996 to 2022. The data is available for download from here https://www.clarin.si/repository/xmlui/handle/11356/1859 or it can be consulted online via concordance tools of the CLARIN.si infrastructure.


Publications

  • Wissik, Tanja, and Hannes Pirker. 2018. ParlAT beta Corpus of Austrian Parliamentary Records. Darja Fišer, Eskevich, Maria, and de Jong, Franciska. LREC2018 Workshop ParlaCLARIN: Creating and Using Parliamentary Corpora In Proceedings of the Eleventh International Conference on Language Resources and Evaluation LREC2018. Miyazaki: European Language Resources Association.
Project lead

Tanja Wissik

 

Contact (ACDH-CH)

Tanja Wissik

Hannes Pirker

 

Funding

CLARIN

 

Project duration

01/2022–09/2023

 

Links

ParlaMint website