Title |
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora |
Authors |
Diana Santos and Cristina Mota |
Abstract |
In this paper, we present a system to aid human annotation of semantic information in the scope of the project AC/DC, called corte-e-costura. This system leverages on the human annotation effort, by providing the annotator with a simple system that applies rules incrementally. Our goal was twofold: first, to develop an easy-to-use system that required a minimum of learning from the part of the linguist; second, one that provided a straightforward way of checking the results obtained, in order to immediately evaluate the results of the rules devised. After explaining the motivation for its development from scratch, we present the current status of the AC/DC project and provide a quantitative description of its material in what concerns semantic annotation. We then present the corte-e-costura system in detail, providing the result of our first experiments with the semantic fields of colour and clothing. We end the paper with some discussion of future work as well as of the experience gained. |
Topics |
Corpus (creation, annotation, etc.), Semantics, LR national/international projects, organizational/policy issues |
Full paper |
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora |
Slides |
- |
Bibtex |
@InProceedings{SANTOS10.457,
author = {Diana Santos and Cristina Mota}, title = {Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |