Title

Discarding noise in an automatically acquired lexicon of support verb constructions

Author(s)

M. Begoña Villada Moirón

Humanities Computing. University of Groningen

Session

P21-W

Abstract

We applied data-driven methods to carry out automatic acquisition of Dutch prepositional support verb constructions (SVCs) in corpora (e.g., iets in de gaten houden (``keep an eye on something'')). This paper addresses the question whether linguistic diagnostics help to discard noise from the nbest lists and how to (semi-)automatically apply such linguistic diagnostics to parsed corpora. We show that some of the linguistic diagnostics proposed in Hollebrandse (1993) effectively identify SVCs and contribute a modest error rate decrease.

Keyword(s)

support verb constructions, automatic acquisition, noise filtering

Language(s)

Dutch

Full Paper

442.pdf