Summary of the paper

Title HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Authors Fernando Fernández-Martínez, Juan Manuel Lucas-Cuesta, Roberto Barra Chicote, Javier Ferreiros and Javier Macías-Guarasa
Abstract In this paper, we describe a new multi-purpose audio-visual database on the context of speech interfaces for controlling household electronic devices. The database comprises speech and video recordings of 19 speakers interacting with a HIFI audio box by means of a spoken dialogue system. Dialogue management is based on Bayesian Networks and the system is provided with contextual information handling strategies. Each speaker was requested to fulfil different sets of specific goals following predefined scenarios, according to both different complexity levels and degrees of freedom or initiative allowed to the user. Due to a careful design and its size, the recorded database allows comprehensive studies on speech recognition, speech understanding, dialogue modeling and management, microphone array based speech processing, and both speech and video-based acoustic source localisation. The database has been labelled for quality and efficiency studies on dialogue performance. The whole database has been validated through both objective and subjective tests.
Topics Corpus (creation, annotation, etc.), Dialogue, Voice Command and Control
Full paper HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish
Slides -
Bibtex @InProceedings{FERNNDEZMARTNEZ10.230,
  author = {Fernando Fernández-Martínez and Juan Manuel Lucas-Cuesta and Roberto Barra Chicote and Javier Ferreiros and Javier Macías-Guarasa},
  title = {HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA