Title |
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration |
Authors |
Cvetana Krstev, Ranka Stanković and Duško Vitas |
Abstract |
In this paper we discuss some well-known morphological descriptions used in various projects and applications (most notably MULTEXT-East and Unitex) and illustrate the encountered problems on Serbian. We have spotted four groups of problems: the lack of a value for an existing category, the lack of a category, the interdependence of values and categories lacking some description, and the lack of a support for some types of categories. At the same time, various descriptions often describe exactly the same morphological property using different approaches. We propose a new morphological description for Serbian following the feature structure representation defined by the ISO standard. In this description we try do incorporate all characteristics of Serbian that need to be specified for various applications. We have developed several XSLT scripts that transform our description into descriptions needed for various applications. We have developed the first version of this new description, but we treat it as an ongoing project because for some properties we have not yet found the satisfactory solution. |
Topics |
Morphology, Lexicon, lexical database, Standards for LRs |
Full paper |
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration |
Slides |
- |
Bibtex |
@InProceedings{KRSTEV10.66,
author = {Cvetana Krstev and Ranka Stanković and Duško Vitas}, title = {A Description of Morphological Features of Serbian: a Revision using Feature System Declaration}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |