Title |
A Recursive Annotation Scheme for Referential Information Status |
Authors |
Arndt Riester, David Lorenz and Nina Seemann |
Abstract |
We provide a robust and detailed annotation scheme for information status, which is easy to use, follows a semantic rather than cognitive motivation, and achieves reasonable inter-annotator scores. Our annotation scheme is based on two main assumptions: firstly, that information status strongly depends on (in)definiteness, and secondly, that it ought to be understood as a property of referents rather than words. Therefore, our scheme banks on overt (in)definiteness marking and provides different categories for each class. Definites are grouped according to the information source by which the referent is identified. A special aspect of the scheme is that non-anaphoric expressions (e.g.\ names) are classified as to whether their referents are likely to be known or unknown to an expected audience. The annotation scheme provides a solution for annotating complex nominal expressions which may recursively contain embedded expressions. In annotating a corpus of German radio news bulletins, a kappa score of .66 for the full scheme was achieved, a core scheme of six top-level categories yields kappa = .78. |
Topics |
Anaphora, Coreference, Corpus (creation, annotation, etc.), Discourse annotation, representation and processing |
Full paper |
A Recursive Annotation Scheme for Referential Information Status |
Slides |
A Recursive Annotation Scheme for Referential Information Status |
Bibtex |
@InProceedings{RIESTER10.764,
author = {Arndt Riester and David Lorenz and Nina Seemann}, title = {A Recursive Annotation Scheme for Referential Information Status}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |