ABSTRACT
In the framework of BABEL project titled "A Multilingual Data-base Collection" the task was not only the collection of a large speech corpora, but also the segmentation of continuously spoken paragraphs on phonetic level. Automatic segmentation technique was needed to help our work. There are some tools prepared before for that purposes, but generally those were prepared for one language, and/or based on the excellent but very expensive HTK program.
The aim of our work is to develop such an automatic segmentation system what is useable for many European languages, giving a good help in the segmentation and labelling of the clear speech databases. So different languages in BABEL project and EUROM 1 project were examined and find an optimal solution.
CONCEPTOur concept for automatic labelling is the following: if you know the labels of the examined sentences, and you can automatically segment the acoustically quasi homogenous parts in the examined sentences, the labelling of each single segment is executable.
RESULTS OF THE AUTOMATIC SEGMENTATION
The hand made and the automatic segmentation of the same sentences were compared with each other, and presented at the same time, so these two results are comparable.
Same faces were examined:
Language |
Hungarian (H)
|
German (G)
|
English (E)
|
Bulgarian (B) |
||||
Type of training material |
Hungarian |
Mixed H_E_B |
German |
Mixed H_E_B |
English |
Mixed H_E_B |
Bulgarian |
Mixed H_E_B |
resonant constant |
76 |
78 |
75 |
69 |
83 |
77 |
86 |
85 |
spirant constant |
88 |
87 |
96 |
92 |
95 |
97 |
94 |
96 |
all phonemes |
85 |
86 |
82 |
78 |
83 |
83 |
89 |
89 |
The results were prepared on the training of of the net by 4 paragraph (2 women and 2men, 20 sentences), and examined by 4 paragraph (2 women and 2men, 20 sentences), differing from the trained examples.