Title |
Annotating a Corpus to Develop and Evaluate Discourse Entity Realization Algorithms: Issues and Preliminary Results |
Authors |
Poesio Massimo (University of Edinburgh, HCRC and Informatics, Massimo.Poesio@ed.ac.uk) |
Keywords |
Anaphora, Corpus Annotation, Empirical Methods, Evaluation, Generation, Referential Expressions |
Session |
Session WO5 - Corpus Tools |
Full Paper |
193.ps, 193.pdf |
Abstract |
We are annotating a corpus with information relevant to discourse entity realization, and especially the information needed to decide which type of NP to use. The corpus is being used to study correlations between NP type and certain semantic or discourse features, to evaluate hand-coded algorithms, and to train statistical models. We report on the development of our annotation scheme, the problems we have encountered, and the results obtained so far. |