Title |
ADAM: The SI-TAL Corpus of Annotated Dialogues |
Authors |
Roldano Cattoni (itc-IRST, Trento, Italy) Morena Danieli (Loquendo S.p.A., Torino, Italy) Vanessa Sandrini (itc-IRST, Trento, Italy) Claudia Soria (ILC-CNR, Pisa, Italy) |
Session |
SO3: Dialogue-Conversation Evaluation |
Abstract |
In this paper we describe the methodological assumptions, general architectural framework and annotation and encoding practices underlying the ADAM Corpus, which has been developed as part of the Italian national project SI-TAL. Each of the 450 dialogues is represented by an orthographic transcription and is annotated at five levels of linguistic information, namely prosody, pos tagging, syntax, semantics, and pragmatics. |
Keywords |
Spoken dialogue corpus, Multilevel annotation, Standoff markup, Annotation standards, Validation |
Full Paper |