Summary of the paper

Title A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Authors Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal
Abstract We present an annotation study on a representative dataset of literal and idiomatic uses of German infinitive-verb compounds in newspaper and journal texts. Infinitive-verb compounds form a challenge for writers of German, because spelling regulations are different for literal and idiomatic uses. Through the participation of expert lexicographers we were able to obtain a high-quality corpus resource which offers itself as a testbed for automatic idiomaticity detection and coarse-grained word-sense disambiguation. We trained a classifier on the corpus which was able to distinguish literal and idiomatic uses with an accuracy of 85 %.
Topics Corpus (Creation, Annotation, etc.), Semantics, Word Sense Disambiguation
Full paper A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
Bibtex @InProceedings{HORBACH16.229,
  author = {Andrea Horbach and Andrea Hensler and Sabine Krome and Jakob Prange and Werner Scholze-Stubenrecht and Diana Steffen and Stefan Thater and Christian Wellner and Manfred Pinkal},
  title = {A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portorož, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
 }
Powered by ELDA © 2016 ELDA/ELRA