Automatic speech synthesis unit generation with MLP based postprocessor against auto-segmented phoneme errors

Author

Park, Em-Young ; Kim, Sang-Hm ; Chung, Jae-Ho

Author_Institution

Dept. of Electron. Eng., Inha Univ., Inchon, South Korea

Volume

5

fYear

1999

fDate

1999

Firstpage

2985

Abstract

The work presented is about the postprocessor, which improves the performance of an automatic speech segmentation system by correcting the phoneme boundary errors. The proposed postprocessor reduces the range of errors in the auto labeled results that are ready to be used directly as synthesis unit. Starting from a baseline automatic segmentation system, our proposed postprocessor trains the features of hand labeled results using a multi-layer perceptron (MLP) algorithm. Then, the auto labeled result combined with the MLP postprocessor determines a new phoneme boundary. For phonetically rich sentences, we have achieved 19.9% improvement for the frame accuracy, comparing with the performance of a conventional automatic labeling system. Also, we have reduced the absolute error about 28.6%

Keywords

feature extraction; learning (artificial intelligence); multilayer perceptrons; speech synthesis; auto labeled result; automatic speech segmentation system; automatic speech synthesis unit generation; frame accuracy; hand labeled results; phoneme boundary errors; phonetically rich sentences; Databases; Error correction; Frequency; Labeling; Multilayer perceptrons; Signal processing; Signal processing algorithms; Signal synthesis; Speech processing; Speech synthesis;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1999. IJCNN '99. International Joint Conference on

Conference_Location

Washington, DC

ISSN

1098-7576

Print_ISBN

0-7803-5529-6

Type

conf

DOI

10.1109/IJCNN.1999.835996

Filename

835996