Summary of the paper

Title The Development of the Multilingual LUNA Corpus for Spoken Language System Porting
Authors Evgeny Stepanov, Giuseppe Riccardi and Ali Orkan Bayer
Abstract The development of annotated corpora is a critical process in the development of speech applications for multiple target languages. While the technology to develop a monolingual speech application has reached satisfactory results (in terms of performance and effort), porting an existing application from a source language to a target language is still a very expensive task. In this paper we address the problem of creating multilingual aligned corpora and its evaluation in the context of a spoken language understanding (SLU) porting task. We discuss the challenges of the manual creation of multilingual corpora, as well as present the algorithms for the creation of multilingual SLU via Statistical Machine Translation (SMT).
Topics Dialogue, Multilinguality
Full paper The Development of the Multilingual LUNA Corpus for Spoken Language System Porting
Bibtex @InProceedings{STEPANOV14.789,
  author = {Evgeny Stepanov and Giuseppe Riccardi and Ali Orkan Bayer},
  title = {The Development of the Multilingual LUNA Corpus for Spoken Language System Porting},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA