LREC 2000 2nd International Conference on Language Resources & Evaluation
 

Previous Paper   Next Paper

Title Electronic Language Resources for Polish: POLEX, CEGLEX and GRAMLEX
Authors Vetulani Zygmunt (Adam Mickiewicz University, Department of Computer Linguistics and Artificial Intelligence, ul. Matejki 48/49, PL-60769 Pozna, Poland, http://main.amu.edu.pl/~vetulani, vetulani@amu.edu.pl)
Keywords Dictionary Formats, Electronic Dictionaries, NLP Tools, Polish Morphology, Resources
Session Session WP1 - Lexicon
Full Paper 62.ps, 62.pdf
Abstract We present theoretical results and resources obtained within three projects: national project POLEX, Copernicus 1 Project CEGLEX (1032) and Copernicus Project GRAMLEX (632). Morphological resources obtained within these projects contribute to fill-in the gap on the map of available electronic language resources for Polish. After a short presentation of some common methodological bases defined within the POLEX project, we proceed to present methodology and data obtained in CEGLEX and GRAMLEX projects. The intention of the Polish language part of CEGLEX was to test formats proposed by the GENELEX project against Polish data. The aim of the GRAMLEX project was to create a corpus-based morphological resources for Polish. GRAMLEX refers directly to the morphological part of the CEGLEX project. Large samples of data presented here are accessible at http://main.amu.edu.pl/~zlisi/projects.htm.