SUMMARY : Session P2-W

 

Title Annotating COMPARA, a Grammar-aware Parallel Corpus
Authors D. Santos, S. Inácio
Abstract In this paper we describe the annotation of COMPARA, currently the largest post-edited parallel corpora which include Portuguese. We describe the motivation, the results so far, and the way the corpus is being annotated. We also provide the first grounded results about syntactical ambiguity in Portuguese. Finally, we discuss some interesting problems in this connection.
Keywords corpus annotation, parallel corpora, categorial ambiguity, corpus search, annotaion guidelines, vagueness
Full paper Annotating COMPARA, a Grammar-aware Parallel Corpus