LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | Evaluating Wordnets in Cross-language Information Retrieval: the ITEM Search Engine |
Authors | Verdejo Felisa (Depto. de Ingeniería Eléctrica, Electrónica y de Control, UNED, Spain, felisa@ieec.uned.es) Gonzalo Julio (Depto. de Ingeniería Eléctrica, Electrónica y de Control, UNED, Spain, julio@ieec.uned.es) Peñas Anselmo (Depto. de Ingeniería Eléctrica, Electrónica y de Control, UNED, Spain, anselmo@ieec.uned.es) López Fernando (Depto. de Ingeniería Eléctrica, Electrónica y de Control, UNED, Spain, flopez@ieec.uned.es) Fernández David (Depto. de Ingeniería Eléctrica, Electrónica y de Control, UNED, Spain, david@ieec.uned.es) |
Keywords | Cross-Language Information Retrieval, Evaluation of Language Resources and Tools, Natural Language Processing and Information Retrieval |
Session | Session WP9 - Applications using Written Language Resources |
Full Paper | 250.ps, 250.pdf |
Abstract | This paper presents the ITEM multilingual search engine. This search engine performs full lexical processing (morphological analysis, tagging and Word Sense Disambiguation) on documents and queries in order to provide language-neutral indexes for querying and retrieval. The indexing terms are the EuroWordNet/ITEM InterLingual Index records that link wordnets in 10 languages of the European Community (the search engine currently supports Spanish, English and Catalan). The goal of this application is to provide a way of comparing in context the behavior of different Natural Language Processing strategies for Cross-Language Information Retrieval (CLIR) and, in particular, different Word Sense Disambiguation strategies for query translation and conceptual indexing. |