Title |
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness |
Authors |
Archna Bhatia, Mandy Simons, Lori Levin, Yulia Tsvetkov, Chris Dyer and Jordan Bender |
Abstract |
"We present a definiteness annotation scheme that captures the semantic, pragmatic, and discourse information, which we call communicative functions, associated with linguistic descriptions such as ""a story about my speech"", ""the story"", ""every time I give it"", ""this slideshow"". A survey of the literature suggests that definiteness does not express a single communicative function but is a grammaticalization of many such functions, for example, identifiability, familiarity, uniqueness, specificity. Our annotation scheme unifies ideas from previous research on definiteness while attempting to remove redundancy and make it easily annotatable. This annotation scheme encodes the communicative functions of definiteness rather than the grammatical forms of definiteness. We assume that the communicative functions are largely maintained across languages while the grammaticalization of this information may vary. One of the final goals is to use our semantically annotated corpora to discover how definiteness is grammaticalized in different languages. We release our annotated corpora for English and Hindi, and sample annotations for Hebrew and Russian, together with an annotation manual." |
Topics |
Semantics, Machine Translation, SpeechToSpeech Translation |
Full paper |
A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness |
Bibtex |
@InProceedings{BHATIA14.1194,
author = {Archna Bhatia and Mandy Simons and Lori Levin and Yulia Tsvetkov and Chris Dyer and Jordan Bender}, title = {A Unified Annotation Scheme for the Semantic/Pragmatic Components of Definiteness}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |