Title |
Interoperability and Customisation of Annotation Schemata in Argo |
Authors |
Rafal Rak, Jacob Carter, andrew Rowley, Riza Theresa Batista-Navarro and Sophia Ananiadou |
Abstract |
The process of annotating text corpora involves establishing annotation schemata which define the scope and depth of an annotation task at hand. We demonstrate this activity in Argo, a Web-based workbench for the analysis of textual resources, which facilitates both automatic and manual annotation. Annotation tasks in the workbench are defined by building workflows consisting of a selection of available elementary analytics developed in compliance with the Unstructured Information Management Architecture specification. The architecture accommodates complex annotation types that may define primitive as well as referential attributes. Argo aids the development of custom annotation schemata and supports their interoperability by featuring a schema editor and specialised analytics for schemata alignment. The schema editor is a self-contained graphical user interface for defining annotation types. Multiple heterogeneous schemata can be aligned by including one of two type mapping analytics currently offered in Argo. One is based on a simple mapping syntax and, although limited in functionality, covers most common use cases. The other utilises a well established graph query language, SPARQL, and is superior to other state-of-the-art solutions in terms of expressiveness. We argue that the customisation of annotation schemata does not need to compromise their interoperability. |
Topics |
Corpus (Creation, Annotation, etc.), LR Infrastructures and Architectures |
Full paper |
Interoperability and Customisation of Annotation Schemata in Argo |
Bibtex |
@InProceedings{RAK14.1086,
author = {Rafal Rak and Jacob Carter and andrew Rowley and Riza Theresa Batista-Navarro and Sophia Ananiadou}, title = {Interoperability and Customisation of Annotation Schemata in Argo}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |