LREC 2000 2nd International Conference on Language Resources & Evaluation
 

Previous Paper   Next Paper

Title Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation
Authors de Loupy Claude (Laboratoire Informatique d’Avignon, B.P. 1228, Agroparc, 339 chemin des Meinajaries, 84911 Avignon, cedex 9, FRANCE, claude.de.loupy@lia.univ-avignon.fr)
El-Bèze Marc (Laboratoire Informatique d’Avignon, B.P. 1228, Agroparc, 339 chemin des Meinajaries, 84911 Avignon, cedex 9, FRANCE, marc.elbeze@lia.univ-avignon.fr)
Keywords Probabilistic Method, Size-Limited Resources, Word Sense Disambiguation
Session Session WO5 - Corpus Tools
Full Paper 350.ps, 350.pdf
Abstract Word Sense Disambiguation (WSD) is considered as one of the most difficult tasks in Natural Language Processing. Probabilistic methods have shown their efficiency in many NLP tasks, but they imply a training phase and very few resources are available for WSD. This paper aims at showing how to make the most of size-limited resources in order to partially overcome the knowledge acquisition bottleneck. Experiments are performed within the SENSEVAL test framework in order to evaluate the advantage of a lemmatized or stemmed context over an original context (inflected forms as they are observed in the rough text). Then, we measure the precision improvement (about 6 %) when looking at the inflected form of the word to be disambiguated. Lastly, we show that it is possible to reduce the ambiguity if the word to be disambiguated has a particular inflected form or occurs as part of a compound.