Title

VIQTORYA -- A Visual Query Tool for Syntactically Annotated Corpora

Authors

Ilona Steiner (Seminar für Sprachwissenschaft, Universität Tübingen Wilhelmstr. 113, D-72074 Tübingen, Germany)

Laura Kallmeyer (TALaNa-Lattice, Université Paris 7 2, place Jussieu, F-75251 Paris cedex 05, France)

Session

WP4: Corpus Annotation

Abstract

This paper presents a query tool for syntactically annotated corpora. The query tool is developed to search the Tübingen Treebanks annotated at the University of Tübingen. However, in principle it also can be adapted to other corpora. The tool uses a query language that allows to search for tokens, syntactic categories, grammatical functions and binary relations of (immediate) dominance and linear precedence between nodes. The overall idea is to extract in an initializing phase the relevant information from the corpus and store it in a compact way in a relational database. An incoming query is then translated into a corresponding SQL query that is evaluated on the database. A graphical user interface allows to specify queries in a user-friendly way.

Keywords

Query tool, Query language, Treebank, Linguistic database, Syntactic annotation

Full Paper

116.pdf