LREC 2000 2nd International Conference on Language Resources & Evaluation
 

Previous Paper   Next Paper

Title SPEECHDAT-CAR. A Large Speech Database for Automotive Environments
Authors Moreno Asunción (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN, http://gps-tsc.upc.es/veu, asuncion@tsc.upc.es)
Lindberg Børge (Center for PersonKommunikation (CPK), Aalborg, Denmark)
Draxler Christoph (IPSK of the University of Munich)
Richard Gaël (Lernout & Hauspie , France)
Choukri Khalid (European Language Resources Association (ELRA) &, European Language resources - Distribution Agency (ELDA), 55-57, rue Brillat-Savarin, 75013 Paris France, choukri@elda.fr)
Euler Stephan (Robert Bosch GmbH Germany)
Allen Jeffrey (European Language Resources Association (ELRA) &, European Language resources - Distribution Agency (ELDA), 55-57, rue Brillat-Savarin, 75013 Paris France, jeff@elda.fr)
Keywords Car environment, GSM signals, Multilingual, Oral Databases, Speech Recognition
Session Session SP3 - Spoken Language Resources' Projects
Full Paper 373.ps, 373.pdf
Abstract The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, British English, Finnish, Flemish/Dutch, French, German, Greek, Italian, Spanish and American English. For each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic environments (low speed, high speed with audio equipment on, etc.). This paper gives an overview of the project with a focus on the production phases (recording platforms, speaker recruitment, annotation and distribution).