Title |
A treebank-based study on the influence of Italian word order on parsing performance |
Authors |
Anita Alicante, Cristina Bosco, Anna Corazza and Alberto Lavelli |
Abstract |
The aim of this paper is to contribute to the debate on the issues raised by Morphologically Rich Languages, and more precisely to investigate, in a cross-paradigm perspective, the influence of the constituent order on the data-driven parsing of one of such languages(i.e. Italian). It shows therefore new evidence from experiments on Italian, a language characterized by a rich verbal inflection, which leads to a widespread diffusion of the pro―drop phenomenon and to a relatively free word order. The experiments are performed by using state-of-the-art data-driven parsers (i.e. MaltParser and Berkeley parser) and are based on an Italian treebank available in formats that vary according to two dimensions, i.e. the paradigm of representation (dependency vs. constituency) and the level of detail of linguistic information. |
Topics |
Grammar and Syntax, Parsing, Knowledge Discovery/Representation |
Full paper |
A treebank-based study on the influence of Italian word order on parsing performance |
Bibtex |
@InProceedings{ALICANTE12.561,
author = {Anita Alicante and Cristina Bosco and Anna Corazza and Alberto Lavelli}, title = {A treebank-based study on the influence of Italian word order on parsing performance}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |