PLM2012 Special Session: Corpora and corpus researching tools in Poland
Organizer: Małgorzata Fabiszak
This session will consist of presentations introducing the resources and tools available for the analysis of the Polish language.
Prof. Adam Przepiórkowski will talk about the National Corpus of Polish (Polish acronym: NKJP: Narodowy Korpus Języka Polskiego), which has been compiled through the cooperation of four institutions: Institute of Computer Science at the Polish Academy of Sciences (coordinator), Institute of Polish Language at the Polish Academy of Sciences, Polish Scientific Publishers PWN, and the Department of Computational and Corpus Linguistics at the University of Łódź.
Dr. Piotr Pęzik will introduce language tools and resources developed by the PELCRA team within the NKJP and CESAR projects, including the PELCRA search engine for NKJP, Polish and English online collocation dictionaries, phraseology exploration tools and parallel corpora.
Dr. Joanna Szwabe will present the Speech Corpus of the Polish Children (Korpus Mowy Dzieci Polskich) – an ongoing research project , which aims at collecting 120 hours of spontaneous speech of children between 3 and 6 years of age.
Dr. Przemysław Kaszubski will show the practical application of corpus work to teaching writingin EFL at academic level on the basis of PICLE – the Polish subcorpus of the International Corpus of Learner English (ICLE).