Title |
Towards Spanish Verbs Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus |
Authors |
Jordi Carrrera, Irene Castellón, Salvador Climent and Marta Coll-Florit |
Abstract |
We present the results of an agreement task carried out in the framework of the KNOW Project and consisting in manually annotating an agreement sample totaling 50 sentences extracted from the SenSem corpus. Diambiguation was carried out for all nouns, proper nouns and adjectives in the sample, all of which were assigned EuroWordNet (EWN) synsets. As a result of the task, Spanish WN has been shown to exhibit 1) lack of explanatory clarity (it does not define word meanings, but glosses and examplifies them instead; it does not systematically encode metaphoric meanings, either); 2) structural inadequacy (some words appear as hyponyms of another sense of the same word; sometimes there even coexist in Spanish WN a general sense and a specific one related to the same concept, but with no structural link in between; hyperonymy relationships have been detected that are likely to raise doubts to human annotators; there can even be found cases of auto-hyponymy); 3) cross-linguistic inconsistency (there exist in English EWN concepts whose lexical equivalent is missing in Spanish WN; glosses in one language more often than not contradict or diverge from glosses in another language). |
Language |
Single language |
Topics |
Corpus (creation, annotation, etc.), Word Sense Disambiguation, Semantics |
Full paper |
Towards Spanish Verbs Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus |
Slides |
- |
Bibtex |
@InProceedings{CARRRERA08.604,
author = {Jordi Carrrera, Irene Castellón, Salvador Climent and Marta Coll-Florit},
title = {Towards Spanish Verbs Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |