Summary of the paper

Title The AUTONOMATA Spoken Names Corpus
Authors Henk van den Heuvel, Jean-Pierre Martens, Bart D’hoore, Kristof D’hanens, Nanneke Konings
Abstract In the Autonomata project we have collected a corpus of spoken name utterances with manually corrected phonemic transcriptions of these utterances. The corpus was designed with the intention to become a major resource for the development of automatic speech recognition engines that can achieve a high accuracy on the recognition of person and geographical names spoken in Dutch. The recorded names were selected so as to reveal the major pronunciation variations that a speech recognizer of e.g. a navigation system with speech input is going to be confronted with. This includes native speakers speaking foreign names and vice versa.
Language Multiple languages
Topics Corpus (creation, annotation, etc.), Speech resource/database, Multilinguality
Full paper The AUTONOMATA Spoken Names Corpus
Slides -
Bibtex @InProceedings{VANDENHEUVEL08.48,
  author = {Henk van den Heuvel, Jean-Pierre Martens, Bart D’hoore, Kristof D’hanens, Nanneke Konings},
  title = {The AUTONOMATA Spoken Names Corpus},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA