Title

A powerful and versatile XML format for representing role-semantic annotation

Author(s)

Katrin Erk, Sebastian Padó

Computational Linguistics, Saarland University, Saarbrücken, Germany

Session

O20-W

Abstract

We present two XML formats for the description and encoding of semantic role information in corpora. The TIGER/SALSA XML format provides a modular representation for semantic roles and syntactic structure. The Text-SALSA XML format is a lightweight version of TIGER/SALSA XML designed for manual annotation with an XML editor rather than a special tool. Both formats can deal with underspecification, roles crossing the sentence boundary, compound splitting, and whole-sentence tags for meta-level comments.

Keyword(s)

semantic roles, XML, representation, multi-level annotation, corpora

Language(s) German, English
Full Paper

202.pdf