LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | A Treebank of Spanish and its Application to Parsing |
Authors | Moreno Antonio (Universidad de Málaga, F. Filosofía y Letras, Campus de Teatinos, 29071 Málaga, Spain, amo@uma.es) Grishman Ralph (Department of Computer Science, New York University, U.S.A, grishman@cs.nyu.edu) López Susana (Laboratorio de Lingüística Informática, Universidad Autónoma de Madrid, Spain, susana@maria.lllf.uam.es) Sánchez Fernando (Laboratorio de Lingüística Informática, Universidad Autónoma de Madrid, Spain, fernando@maria.lllf.uam.es) Sekine Satoshi (Department of Computer Science, New York University, U.S.A, sekine@cs.nyu.edu) |
Keywords | Grammar Acquisition, Parsing, Spanish, Syntax, Treebank |
Session | Session WO2 - Treebanks |
Full Paper | 66.ps, 66.pdf |
Abstract | This paper presents joint research between a Spanish team and an American one on the development and exploitation of a Spanish treebank. Such treebanks for other languages have proven valuable for the development of high-quality parsers and for a wide variety of language studies. However, when the project started, at the end of 1997, there was no syntactically annotated corpus for Spanish. This paper describes the design of such a treebank and its initial application to parser construction. |