SUMMARY : Session O36-W Semantically Annotated Corpora

 

Title The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Authors A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado
Abstract This paper describes the SALSA corpus, a large German corpus manually annotated with manual role-semantic annotation, based on the syntactically annotated TIGER newspaper corpus. The first release, comprising about 20,000 annotated predicate instances (about half the TIGER corpus), is scheduled for mid-2006. In this paper we discuss the annotation framework (frame semantics) and its cross-lingual applicability, problems arising from exhaustive annotation, strategies for quality control, and possible applications.
Keywords lexical semantics, semantic roles, lexical resource, corpus annotation, FrameNet, German, noncompositionality
Full paper The SALSA Corpus: a German Corpus Resource for Lexical Semantics