Title

Multi-level XML-based Corpus Annotation

Authors

Harris Papageorgiou (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Prokopis Prokopidis (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Voula Giouli (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Iason Demiros (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Alexis Konstantinidis (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Stelios Piperidis (Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece)

Session

WP4: Corpus Annotation

Abstract

In this paper we present the methodological principles and the implementation framework of text annotation process in an Information Extraction setting. Due to the recent prevalence of XML as a means for describing structured documents in a reusable format, our team has switched to an XML based annotation schema. In that framework, an XML annotation platform has been built, while processing tools, lexical resources and textual data communicate with each other via this platform. Editing/viewing tools have been implemented, endowed with functionalities that allow annotators to gain access to previous annotation levels as well as necessary lexical resources.

Keywords

Corpus annotation, XML, Greek

Full Paper

159.pdf