LREC 2000 2nd International Conference on Language Resources & Evaluation | |
Conference Papers
Papers by paper title: A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Papers by ID number: 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-377. |
Previous Paper Next Paper
Title | Using Few Clues Can Compensate the Small Amount of Resources Available for Word Sense Disambiguation |
Authors |
de Loupy Claude (Laboratoire Informatique d’Avignon, B.P. 1228, Agroparc, 339 chemin des Meinajaries, 84911 Avignon, cedex 9, FRANCE, claude.de.loupy@lia.univ-avignon.fr) El-Beze Marc (Laboratoire Informatique d’Avignon, B.P. 1228, Agroparc, 339 chemin des Meinajaries, 84911 Avignon, cedex 9, FRANCE, marc.elbeze@lia.univ-avignon.fr) |
Keywords | Probabilistic Method, Size-Limited Resources, Word Sense Disambiguation |
Session | Session WO5 - Corpus Tools |
Abstract | Word Sense Disambiguation (WSD) is considered as one of the most difficult tasks in Natural Language Processing. Probabilistic methods have shown their efficiency in many NLP tasks, but they imply a training phase and very few resources are available for WSD. This paper aims at showing how to make the most of size-limited resources in order to partially overcome the knowledge acquisition bottleneck. Experiments are performed within the SENSEVAL test framework in order to evaluate the advantage of a lemmatized or stemmed context over an original context (inflected forms as they are observed in the rough text). Then, we measure the precision improvement (about 6 %) when looking at the inflected form of the word to be disambiguated. Lastly, we show that it is possible to reduce the ambiguity if the word to be disambiguated has a particular inflected form or occurs as part of a compound. |