LREC 2000 2nd International Conference on Language Resources & Evaluation | ||||||
Title | EULER: an Open, Generic, Multilingual and Multi-platform Text-to-Speech System |
Authors | Dutoit Thierry (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: fdutoit@tcts.fpms.ac.be) Bagein Michel (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: bagein@tcts.fpms.ac.be) Malfrère Fabrice (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: bagein@tcts.fpms.ac.be) Pagel Vincent (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: pagel@tcts.fpms.ac.be) Ruelle Alain (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: ruelle@tcts.fpms.ac.be) Tounsi Nawfal (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: tounsig@tcts.fpms.ac.be) Wynsberghe Dominique (Faculté Polytechnique de Mons, Circuits Theory and Signal Processing Lab, Bâtiment Multitel, Parc Initialis, av. Copernic, B7000 Mons, BELGIUM, Tel: +32 65 374733 Fax: +32 65 374729, Web: http://tcts.fpms.ac.be, Email: tounsig@tcts.fpms.ac.be) |
Keywords | |
Session | Session SO3 - Speech Synthesis |
Full Paper | 41.ps, 41.pdf |
Abstract | The aim of the collaborative project presented in this paper is to obtain a set of highly modular Text-To-Speech synthesizers for as many voices, languages and dialects as possible, free for use in non-commercial and non-military applications. This project is an extension of the MBROLA project: MBROLA is a speech synthesizer, freely distributed for non-commercial purposes, which uses diphone databases provided by users (19 languages in year 2000). Euler extends this idea to whole TTS systems by providing a backbone structure (MLC) and several generic algorithms for POS tagging, grapheme-to-phoneme conversion, and prosody generation. To demonstrate the potentials of the architecture and draw developpers’ interest we provide a full EULER-based TTS in French and in Arabic. Euler currently runs on Windows and Linux, and it is an open project: many of its components (and certainly its kernel) are provided as GNU C++ sources. It also incorporates, as much as possible, components and data derived from other TTS-related projects. |