Title :
Automatic speech synthesis unit generation with MLP based postprocessor against auto-segmented phoneme errors
Author :
Park, Em-Young ; Kim, Sang-Hm ; Chung, Jae-Ho
Author_Institution :
Dept. of Electron. Eng., Inha Univ., Inchon, South Korea
Abstract :
The work presented is about the postprocessor, which improves the performance of an automatic speech segmentation system by correcting the phoneme boundary errors. The proposed postprocessor reduces the range of errors in the auto labeled results that are ready to be used directly as synthesis unit. Starting from a baseline automatic segmentation system, our proposed postprocessor trains the features of hand labeled results using a multi-layer perceptron (MLP) algorithm. Then, the auto labeled result combined with the MLP postprocessor determines a new phoneme boundary. For phonetically rich sentences, we have achieved 19.9% improvement for the frame accuracy, comparing with the performance of a conventional automatic labeling system. Also, we have reduced the absolute error about 28.6%
Keywords :
feature extraction; learning (artificial intelligence); multilayer perceptrons; speech synthesis; auto labeled result; automatic speech segmentation system; automatic speech synthesis unit generation; frame accuracy; hand labeled results; phoneme boundary errors; phonetically rich sentences; Databases; Error correction; Frequency; Labeling; Multilayer perceptrons; Signal processing; Signal processing algorithms; Signal synthesis; Speech processing; Speech synthesis;
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-5529-6
DOI :
10.1109/IJCNN.1999.835996