Title |
Semantic Frame Annotation on the French MEDIA corpus |
Authors |
Marie-Jean Meurs, Frédéric Duvert, Frédéric Bechet, Fabrice Lefevre and Renato De Mori |
Abstract |
This paper introduces a knowledge representation formalism used for annotation of the French MEDIA dialogue corpus in terms of high level semantic structures. The semantic annotation, worked out according to the Berkeley FrameNet paradigm, is incremental and partially automated. We describe an automatic interpretation process for composing semantic structures from basic semantic constituents using patterns involving words and constituents. This process contains procedures which provide semantic compositions and generating frame hypotheses by inference. The MEDIA corpus is a French dialogue corpus recorded using a Wizard of Oz system simulating a telephone server for tourist information and hotel booking. It had been manually transcribed and annotated at the word and semantic constituent levels. These levels support the automatic interpretation process which provides a high level semantic frame annotation. The Frame based Knowledge Source we composed contains Frame definitions and composition rules. We finally provide some results obtained on the automatically-derived annotation. |
Language |
Single language |
Topics |
Corpus (creation, annotation, etc.), Knowledge representation, Semantics |
Full paper |
Semantic Frame Annotation on the French MEDIA corpus |
Slides |
- |
Bibtex |
@InProceedings{MEURS08.256,
author = {Marie-Jean Meurs, Frédéric Duvert, Frédéric Bechet, Fabrice Lefevre and Renato De Mori},
title = {Semantic Frame Annotation on the French MEDIA corpus},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |