Title |
Annotation of Discourse Relations for Conversational Spoken Dialogs |
Authors |
Sara Tonelli, Giuseppe Riccardi, Rashmi Prasad and Aravind Joshi |
Abstract |
In this paper, we make a qualitative and quantitative analysis of discourse relations within the LUNA conversational spoken dialog corpus. In particular, we first describe the Penn Discourse Treebank (PDTB) and then we detail the adaptation of its annotation scheme to the LUNA corpus of Italian task-oriented dialogs in the domain of software/hardware assistance. We discuss similarities and differences between our approach and the PDTB paradigm and point out the peculiarities of spontaneous dialogs w.r.t. written text, which motivated some changes in the annotation strategy. In particular, we introduced the annotation of relations between non-contiguous arguments and we modified the sense hierarchy in order to take into account the important role of pragmatics in dialogs. In the final part of the paper, we present a comparison between the sense and connective frequency in a representative subset of the LUNA corpus and in the PDTB. Such analysis confirmed the differences between the two corpora and corroborates our choice to introduce dialog-specific adaptations. |
Topics |
Dialogue, Corpus (creation, annotation, etc.), Discourse annotation, representation and processing |
Full paper |
Annotation of Discourse Relations for Conversational Spoken Dialogs |
Slides |
- |
Bibtex |
@InProceedings{TONELLI10.184,
author = {Sara Tonelli and Giuseppe Riccardi and Rashmi Prasad and Aravind Joshi}, title = {Annotation of Discourse Relations for Conversational Spoken Dialogs}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |