LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | Resources for Multilingual Text Generation in Three Slavic Languages |
Authors | Bateman John A. (Linguistics and Literature, University of Bremen, bateman@uni-bremen.de) Teich Elke (Institute for Applied Linguistics, Translation and Interpretation, University of the Saarland, e.teich@mx.uni-saarland.de) Kruijff Geert-Jan (Institute of Formal and Applied Linguistics, Charles University, gj@ufal.mff.cuni.cz) Kruijff-Korbayová Ivanna (Institute of Formal and Applied Linguistics, Charles University, korbay@ufal.mff.cuni.cz) Sharoff Serge (Russian Research Institute for, Artificial Inteligence, sharoff@aha.ru) Skoumalová Hana (Institute of Theoretical and Computational Linguistics, Charles University, hana.skoumalova@ff.cuni.cz) |
Keywords | Generation, Multilingual LR, Reusing LR, Text |
Session | Session WP9 - Applications using Written Language Resources |
Full Paper | 241.ps, 241.pdf |
Abstract | The paper discusses the methods followed to re-use a large-scale, broad-coverage English grammar for constructing similar scale grammars for Bulgarian, Czech and Russian for the fast prototyping of a multilingual generation system. We present (1) the theoretical and methodological basis for resource sharing across languages, (2) the use of a corpus-based contrastive register analysis, in particular, contrastive analysis of mood and agency. Because the study concerns reuse of the grammar of a language that is typologically quite different from the languages treated, the issues addressed in this paper appear relevant to a wider range of researchers in need of large-scale grammars for less-researched languages. |