LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | Named Entity Recognition in Greek Texts |
Authors | Demiros Iason (Institute for Language and Speech Processing Artemidos 6 & Epidavrou, 151 25, Athens, Greece, email: iason@ilsp.gr) Boutsis Sotiris (Institute for Language and Speech Processing, Artemidos 6 & Epidavrou, 151 25 Maroussi, Greece, sboutsis@ilsp.gr) Giouli Voula (Institute for Language and Speech Processing, Artemidos 6 & Epidavrou, 151 25, Athens, Greece, tel: +301 6875300, fax: +301 6854270, voula@ilsp.gr) Liakata Maria (Cambridge University, email: ml257@cam.ac.uk) Papageorgiou Harris (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece, xaris@ilsp.gr) Piperidis Stelios (Institute for Language and Speech Processing, Artemidos 6 & Epidavrou, 151 25, Athens, Greece, tel: +301 6875300, fax: +301 6854270, spip@ilsp.gr) |
Keywords | Greek, Information Extraction, Named Entity Recognition |
Session | Session WO14 - Named Entity Recognition |
Full Paper | 173.ps, 173.pdf |
Abstract | In this paper, we describe work in progress for the development of a named entity recognizer for Greek. The system aims at information extraction applications where large scale text processing is needed. Speed of analysis, system robustness, and results accuracy have been the basic guidelines for the system’s design. Our system is an automated pipeline of linguistic components for Greek text processing based on pattern matching techniques. Non-recursive regular expressions have been implemented on top of it in order to capture different types of named entities. For development and testing purposes, we collected a corpus of financial texts from several web sources and manually annotated part of it. Overall precision and recall are 86% and 81% respectively. |