Summary of the paper

Title The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Authors Guillaume Gravier, Gilles Adda, Niklas Paulsson, Matthieu Carré, Aude Giraudel and Olivier Galibert
Abstract The paper presents a comprehensive overview of existing data for the evaluation of spoken content processing in a multimedia framework for the French language. We focus on the ETAPE corpus which will be made publicly available by ELDA mid 2012, after completion of the evaluation campaign, and recall existing resources resulting from previous evaluation campaigns. The ETAPE corpus consists of 30 hours of TV and radio broadcasts, selected to cover a wide variety of topics and speaking styles, emphasizing spontaneous speech and multiple speaker areas.
Topics Speech resource/database, Speech Recognition/Understanding, Named Entity recognition
Full paper The ETAPE corpus for the evaluation of speech-based TV content processing in the French language
Bibtex @InProceedings{GRAVIER12.495,
  author = {Guillaume Gravier and Gilles Adda and Niklas Paulsson and Matthieu Carré and Aude Giraudel and Olivier Galibert},
  title = {The ETAPE corpus for the evaluation of speech-based TV content processing in the French language},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA