Summary of the paper

Title An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Authors Stergos Afantenos, Nicholas Asher, Farah Benamara, Myriam Bras, Cecile Fabre, Mai Ho-Dac, Anne Le Draoulec, Philippe Muller, Marie-Paul Pery-Woodley, Laurent Prevot, Josette Rebeyrolles, Ludovic Tanguy, Marianne Vergez-Couret and Laure Vieu
Abstract This paper describes the ANNODIS resource, a discourse-level annotated corpus for French. The corpus combines two perspectives on discourse: a bottom-up approach and a top-down approach. The bottom-up view incrementally builds a structure from elementary discourse units, while the top-down view focuses on the selective annotation of multi-level discourse structures. The corpus is composed of texts that are diversified with respect to genre, length and type of discursive organisation. The methodology followed here involves an iterative design of annotation guidelines in order to reach satisfactory inter-annotator agreement levels. This allows us to raise a few issues relevant for the comparison of such complex objects as discourse structures. The corpus also serves as a source of empirical evidence for discourse theories. We present here two first analyses taking advantage of this new annotated corpus --one that tested hypotheses on constraints governing discourse structure, and another that studied the variations in composition and signalling of multi-level discourse structures.
Topics Discourse annotation, representation and processing
Full paper An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus
Bibtex @InProceedings{AFANTENOS12.836,
  author = {Stergos Afantenos and Nicholas Asher and Farah Benamara and Myriam Bras and Cecile Fabre and Mai Ho-Dac and Anne Le Draoulec and Philippe Muller and Marie-Paul Pery-Woodley and Laurent Prevot and Josette Rebeyrolles and Ludovic Tanguy and Marianne Vergez-Couret and Laure Vieu},
  title = {An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA