Title

Portuguese Large-scale Language Resources for NLP Applications

Author(s)

Elisabete Ranchhod (1), Paula Carvalho (1), Cristina Mota (2), Anabela Barreiro (1)

(1) Universidade de Lisboa and LabEL (CAUTL/IST), (2) LabEL (CAUTL/IST)

Session

P19-SW

Abstract

The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and grammars, developed by LabEL. These resources are formalized and applied to texts by means of finite-state techniques, more and more acknowledged in Natural Language Processing. On the one hand, it illustrates methods on lexical representation for simple words and multi-word expressions; on the other hand, it provides examples (in form of concordances) of linguistic structures recognized after the application of disambiguation and parsing grammars to texts. The paper ends with a short reference to the publicly available data highlighting its contribution towards dissemination of LabEL’s knowledge on language technology.

Keyword(s)

Language Resources for Language Engineering, Lexical Resources, Linguistic-based Corpus Processing, MWE – Identification and Tagging, Word Sense Disambiguation, Finite-State Parsing

Language(s) Portuguese
Full Paper

102.pdf