Title

MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora

Author(s)

M. Taulé  (1); M. Civit (1); N. Artigas (1); M. García (1); L. Márquez (2); M. A. Martí (1); B. Navarro (3).

(1) CLiC, Centre de Llenguatge i Computació-Universitat de Barcelona; (2) TALP, Departament LSI-Universitat Politècnica de Catalunya; (3) Departament LSI-Universitat d’Alacant

Session

P19-SW

Abstract

In this paper we present two Spanish corpora, MiniCors and Cast3LB, semantically tagged according to different annotation criteria and objectives. In order to guarantee the quality of the results, we have established a methodology for the development of these corpora. The resulting resources consist of a semantically tagged corpus according to the lexical sample task, and a semantically tagged corpus according to the all words task, both of them defined within the Senseval framework.

Keyword(s)

Corpus Linguistics, Semantic Annotation, Word Sense Disambiguation

Language(s) Spanish
Full Paper

121.pdf