LREC 2000 2nd International Conference on Language Resources & Evaluation | |
Conference Papers
Papers by paper title: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Papers by ID number: 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-377. |
Previous Paper Next Paper
Title | Corpora of Slovene Spoken Language for Multi-lingual Applications |
Authors |
Gros Jerneja (Faculty of Electrical Engineering, University of Ljubljana, Trzaska 25, 1001 Ljubljana, Slovenia, nejka@fe.uni-lj.si) Mihelic France (Faculty of Electrical Engineering, University of Ljubljana, Trzaska 25, 1001 Ljubljana, Slovenia, mihelicf@fe.uni-lj.si) Dobrisek Simon (Faculty of Electrical Engineering, Univercity of Ljubljana, Laboratory of Artificial Perception, Trzaska 25, 1000 Ljubljana, Slovenia, simond@fe.uni-lj.si) Erjavec Tomaz (Dept. for Intelligent Systems, Jozef Stefan Institute, Ljubljana, Slovenia, tomaz.erjavecg@ijs.si) Zganec Mario (Masterpoint R&D, Baznikova 40, 1000 Ljubljana, Slovenia, Mario@masterpoint.si) |
Keywords | Annotation Tools, Continuous Speech, Diphone Inventory, Speech Corpus, Spoken Commands |
Session | Session SP3 - Spoken Language Resources' Projects |
Abstract | The domain of spoken language technologies ranges from speech input and output systems to complex understanding and generation systems, including multi- modal systems of widely differing complexity (such as automatic dictation machines) and multilingual systems (for example automatic dialogue and translation systems). The definition of standards and evaluation methodologies for such systems involves the specification and development of highly specific spoken language corpus and lexicon resources, and measurement and evaluation tools (EAGLES Handbook 1997). This paper presents the MobiLuz spoken resources of the Slovene language, which will be made freely available for research purposes in speech technology and linguistics. |