LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | SALA: SpeechDat across Latin America. Results of the First Phase |
Authors | Moreno Asunción (Universitat Politècnica de Catalunya, Jordi Girona 1-3 08034 Barcelona, SPAIN, http://gps-tsc.upc.es/veu, asuncion@tsc.upc.es) Comeyne Robrecht (Lernout & Hauspie, Ieper, Belgium) Haslam Keith (Vocalis, Cambridge, UK) van den Heuvel Henk (SPEX, Nijmegen, Netherlands, e-mail: H.v.d.Heuvel@let.kun.nl) Höge Harald (Siemens AG, München, Germany) Horbach Sabine (Philips, Aagen, Germany, CSELT, Torino, Italy) Micca Giorgio (CSELT, Via G. Reiss Romoli 274, 10148 Torino, Italia, giorgio.micca@cselt.it) |
Keywords | Latin America, Oral Databases, Spanish and Portuguese, Speech Recognition, Telephone Speech |
Session | Session SP3 - Spoken Language Resources' Projects |
Full Paper | 10.ps, 10.pdf |
Abstract | The objective of the SALA (SpeechDat across Latin America) project is to record large SpeechDat-like databases to train telephone speech recognisers for any country in Latin America. The SALA consortium is composed by several European companies, (CSELT, Italy; Lernout & Hauspie, Belgium; Philips, Germany; Siemens AG, Germany; Vocalis, U.K.) and Universities (UPC Spain, SPEX The Netherlands). This paper gives an overview of the project, introduces the definition of the databases, shows the dialectal distribution in the countries where recordings take place and gives information about validation issues, actual status and practical experiences in recruiting and annotating such large databases in Latin America. |