Title

A pattern extraction workbench combining multiple linguistic levels

Author(s)

Magnus Merkel, Andreas Lange

Linköping University, S-581 83 Linköping, Sweden

Session

P5-W

Abstract

In this paper an interactive pattern extraction workbench, I*Pex, is presented. The workbench comes in a graphical environment and is designed to be used in an incremental and interactive fashion with the user. Patterns can be constructed to work in combination involving specifications on several linguistic levels simultaneously, from the character level using regular expressions, parts of speech and dependency relations to semantic roles. The input text format is based on XCES XML format.

Keyword(s)

Information extraction, pattern extraction, XML, multiple linguistic levels, interactivity, incremental development.

Language(s)

English, Swedish.

Full Paper

336.pdf