Title |
Typical Cases of Annotators Disagreement in Discourse Annotations in Prague Dependency Treebank |
Authors |
Šárka Zikánová, Lucie Mladová, Jiří Mírovský and Pavlína Jínová |
Abstract |
In this paper, we present the first results of the parallel Czech discourse annotation in the Prague Dependency Treebank 2.0. Having established an annotation scenario for capturing semantic relations crossing the sentence boundary in a discourse, and having annotated the first sections of the treebank according to these guidelines, we report now on the results of the first evaluation of these manual annotations. We give an overview of the process of the annotation itself, which we believe is to a large degree language-independent and therefore accessible to any discourse researcher. Next, we describe the inter-annotator agreement measurement, and, most importantly, we classify and analyze the most common types of annotators disagreement and propose solutions for the next phase of the annotation. The annotation is carried out on dependency trees (on the tectogrammatical layer), this approach is quite novel and it brings us some advantages when interpreting the syntactic structure of the discourse units. |
Topics |
Discourse annotation, representation and processing, Corpus (creation, annotation, etc.), Grammar and Syntax |
Full paper |
Typical Cases of Annotators Disagreement in Discourse Annotations in Prague Dependency Treebank |
Slides |
- |
Bibtex |
@InProceedings{ZIKNOV10.762,
author = {Šárka Zikánová and Lucie Mladová and Jiří Mírovský and Pavlína Jínová}, title = {Typical Cases of Annotators Disagreement in Discourse Annotations in Prague Dependency Treebank}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |