Title |
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution |
Authors |
anders Björkelund, Kerstin Eckart, Arndt Riester, Nadja Schauffler and Katrin Schweitzer |
Abstract |
DIRNDL is a spoken and written corpus based on German radio news, which features coreference and information-status annotation (including bridging anaphora and their antecedents), as well as prosodic information. We have recently extended DIRNDL with a fine-grained two-dimensional information status labeling scheme. We have also applied a state-of-the-art part-of-speech and morphology tagger to the corpus, as well as highly accurate constituency and dependency parsers. In the light of this development we believe that DIRNDL is an interesting resource for NLP researchers working on automatic coreference and bridging resolution. In order to enable and promote usage of the data, we make it available for download in an accessible tabular format, compatible with the formats used in the CoNLL and SemEval shared tasks on automatic coreference resolution. |
Topics |
Corpus (Creation, Annotation, etc.), Prosody |
Full paper |
The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution |
Bibtex |
@InProceedings{BJRKELUND14.891,
author = {anders Björkelund and Kerstin Eckart and Arndt Riester and Nadja Schauffler and Katrin Schweitzer}, title = {The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |