Summary of the paper

Title Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
Authors Briony Williams and Rhys James Jones
Abstract A new procedure is described for generating pronunciations for a dictionary of place-names in a less-resourced language (Welsh, spoken in Wales, UK). The method is suitable for use in a situation where there is a lack of skilled phoneticians with expertise in the language, but where there are native speakers available, as well as a text-to-speech synthesiser for the language. The lack of skilled phoneticians will make it impossible to carry out direct editing of pronunciations, and so a method has been devised that makes it possible for non-phonetician native speakers to edit pronunciations without knowledge of the phonology of the language. The key advance in this method is the use of “re-spelling” to indicate pronunciation in a linguistically-naïve fashion on the part of the non-specialist native speaker. The “re-spelled” forms of placenames are used to drive a set of specially-adapted letter-to-sound rules, which generate the pronunciations desired. The speech synthesiser is used to provide audio feedback to the native speaker editor for purposes of verification. A graphical user interface acts as the link between the database, the speech synthesiser and the native speaker editor. This method has been used successfully to generate pronunciations for placenames in Wales.
Language Multiple languages
Topics Lexicon, lexical database, Speech synthesis, Text-to-speech systems, Tools, systems, applications
Full paper Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language
Slides -
Bibtex @InProceedings{WILLIAMS08.55,
  author = {Briony Williams and Rhys James Jones},
  title = {Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language},
  booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
  year = {2008},
  month = {may},
  date = {28-30},
  address = {Marrakech, Morocco},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-4-0},
  note = {http://www.lrec-conf.org/proceedings/lrec2008/},
  language = {english}
  }

Powered by ELDA © 2008 ELDA/ELRA