Title |
The AUTONOMATA Spoken Names Corpus |
Authors |
Henk van den Heuvel, Jean-Pierre Martens, Bart Dhoore, Kristof Dhanens, Nanneke Konings |
Abstract |
In the Autonomata project we have collected a corpus of spoken name utterances with manually corrected phonemic transcriptions of these utterances. The corpus was designed with the intention to become a major resource for the development of automatic speech recognition engines that can achieve a high accuracy on the recognition of person and geographical names spoken in Dutch. The recorded names were selected so as to reveal the major pronunciation variations that a speech recognizer of e.g. a navigation system with speech input is going to be confronted with. This includes native speakers speaking foreign names and vice versa. |
Language |
Multiple languages |
Topics |
Corpus (creation, annotation, etc.), Speech resource/database, Multilinguality |
Full paper |
The AUTONOMATA Spoken Names Corpus |
Slides |
- |
Bibtex |
@InProceedings{VANDENHEUVEL08.48,
author = {Henk van den Heuvel, Jean-Pierre Martens, Bart Dhoore, Kristof Dhanens, Nanneke Konings},
title = {The AUTONOMATA Spoken Names Corpus},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |