Title |
Ontology-Based XQuerying of XML-Encoded Language Resources on Multiple Annotation Layers |
Authors |
Georg Rehm, Richard Eckart, Christian Chiarcos and Johannes Dellert |
Abstract |
We present an approach for querying collections of heterogeneous linguistic corpora that are annotated on multiple layers using arbitrary XML-based markup languages. An OWL ontology provides a homogenising view on the conceptually different markup languages so that a common querying framework can be established using the method of ontology-based query expansion. In addition, we present a highly flexible web-based graphical interface that can be used to query corpora with regard to several different linguistic properties such as, for example, syntactic tree fragments. This interface can also be used for ontology-based querying of multiple corpora simultaneously. |
Language |
Language-independent |
Topics |
Corpus (creation, annotation, etc.), LR Infrastructures and Architectures, Ontologies |
Full paper |
Ontology-Based XQuerying of XML-Encoded Language Resources on Multiple Annotation Layers |
Slides |
Ontology-Based XQuerying of XML-Encoded Language Resources on Multiple Annotation Layers |
Bibtex |
@InProceedings{REHM08.139,
author = {Georg Rehm, Richard Eckart, Christian Chiarcos and Johannes Dellert},
title = {Ontology-Based XQuerying of XML-Encoded Language Resources on Multiple Annotation Layers},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |