Title |
Whats in a Colour? Studying and Contrasting Colours with COMPARA |
Authors |
Diana Santos, Maria do Rosário Silva and Susana Inácio |
Abstract |
In this paper we present contrastive colour studies done using COMPARA, the largest edited parallel corpus in the world (as far as we know). The studies were the result of semantic annotation of the corpus in this domain. We chose to start with colour because it is a relatively contained lexical category and the subject of many arguments in linguistics. We begin by explaining the criteria involved in the annotation process, not only for the colour categories but also for the colour groups created in order to do finer-grained analyses, presenting also some quantitative data regarding these categories and groups. We proceed to compare the two languages according to the diversity of available lexical items, morphological and syntactic properties, and then try to understand the translation of colour. We end by explaining how any user who wants to do serious studies using the corpus can collaborate in enhancing the corpus and making their semantic annotations widely available as well. |
Language |
Multiple languages |
Topics |
Corpus (creation, annotation, etc.), Semantics, Multilinguality |
Full paper |
Whats in a Colour? Studying and Contrasting Colours with COMPARA |
Slides |
- |
Bibtex |
@InProceedings{SANTOS08.73,
author = {Diana Santos, Maria do Rosário Silva and Susana Inácio},
title = {Whats in a Colour? Studying and Contrasting Colours with COMPARA},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |